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ON (54) Title: COMPOSITIONS AND METHODS FOR THE THERAPY AND DIAGNOSIS OF LUNG CANCER 
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£J (57) Abstract: Compositions and methods for the therapy and diagnosis of cancer, particularly lung cancer, are disclosed. Illustra- 
tive compositions comprise one or more lung tumor polypeptides, immunogenic portions thereof, polynucleotides that encode such 
polypeptides, antigen presenting cell that expresses such polypeptides, and T cells that are specific for cells expressing such polypep- 
tides. The disclosed compositions are useful, for example, in the diagnosis, prevention and/or treatment of diseases, particularly lung 
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COMPOSITIONS AND METHODS FOR THE THERAPY 
AND DIAGNOSIS OF LUNG CANCER 



5 TECHNICAL FIELD OF THE INVENTION 

The present invention relates generally to therapy and diagnosis of 
cancer, such as lung cancer. The invention is more specifically related to polypeptides, 
comprising at least a portion of a lung tumor protein, and to polynucleotides encoding 
such polypeptides. Such polypeptides and polynucleotides are useful in pharmaceutical 
10 compositions, e.g., vaccines, and other compositions for the diagnosis and treatment of 
lung cancer. 

BACKGROUND OF THE INVENTION 

Field of the invention 

Lung cancer is the primary cause of cancer death among both men and 
15 women in the U.S., with an estimated 172,000 new cases being reported in 1994. The 
five-year survival rate among all lung cancer patients, regardless of the stage of disease 
at diagnosis, is only 13%. This contrasts with a five-year survival rate of 46% among 
cases detected while the disease is still localized. However, only 16% of lung cancers 
are discovered before the disease has spread. 

20 Description of Related Art 

Early detection is difficult since clinical symptoms are often not seen 
until the disease has reached an advanced stage. Currently, diagnosis is aided by the use 
of chest x-rays, analysis of the type of cells contained in sputum and fiberoptic 
examination of the bronchial passages. Treatment regimens are determined by the type 

25 and stage of the cancer, and include surgery, radiation therapy and/or chemotherapy. In 
spite of considerable research into therapies for the disease, lung cancer remains 
difficult to treat. 

Accordingly, there remains a need in the art for improved vaccines, 
treatment methods and diagnostic techniques for lung cancer. 
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SUMMARY OF THE INVENTION 

In one aspect, the present invention provides polynucleotide 
compositions comprising a sequence selected from the group consisting of: 

(a) sequences provided in SEQ ID NO: 217-390, 392, 394, 396, 398- 
5 420 422-424, 428-433, 440-583, 588-732, 736, 737, 740, 741, 744 and 746; 

(b) complements of the sequences provided in SEQ ID NO: 217-390, 
392, 394, 396, 398-420 422-424, 428-433, 440-583, 588-732, 736, 737, 740, 741, 744 
and 746; 

(c) sequences consisting of at least 20 contiguous residues of a 
10 sequence provided in SEQ ID NO: 217-390, 392, 394, 396, 398-420 422-424, 428-433, 

440-583, 588-732, 736, 737, 740, 741, 744 and 746; 

(d) sequences that hybridize to a sequence provided in SEQ ID NO: 
217-390, 392, 394, 396, 398-420 422-424, 428-433, 440-583, 588-732, 736, 737, 740, 
741, 744 and 746, under moderately stringent conditions; 

15 (e) sequences having at least 75% identity to a sequence of SEQ ID 

NO: 217-390, 392, 394, 396, 398-420 422-424, 428-433, 440-583, 588-732, 736, 737, 
740, 741, 744 and 746; 

(f) sequences having at least 90% identity to a sequence of SEQ ID 
NO: 217-390, 392, 394, 396, 398-420 422-424, 428-433, 440-583, 588-732, 736, 737, 

20 740, 741, 744 and 746; and 

(g) degenerate variants of a sequence provided in SEQ ID NO: 217- 
390, 392, 394, 396, 398-420 422-424, 428-433, 440-583, 588-732, 736, 737, 740, 741, 
744 and 746. 

In one preferred embodiment, the polynucleotide compositions of the 
25 invention are expressed in at least about 20%, more preferably in at least about 30%, 
and most preferably in at least about 50% of lung tumors samples tested, at a level that 
is at least about 2-fold, preferably at least about 5-fold, and most preferably at least 
about 10-fold higher than that for normal tissues. 

The present invention, in another aspect, provides polypeptide 
30 compositions comprising an amino acid sequence that is encoded by a polynucleotide 
sequence described above. 

In specific embodiments, the present invention provides polypeptide 
compositions comprising an amino acid sequence selected from the group consisting of 
sequences recited in SEQ ID NO: 391, 393, 395, 397, 421, 425-427, 434-439, 584-587, 
35 738, 739, 742 and 745. 
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In certain preferred embodiments, the polypeptides and/or 
polynucleotides of the present invention are immunogenic, i.e., they are capable of 
eliciting an immune response, particularly a humoral and/or cellular immune response, 
as further described herein. 
5 The present invention further provides fragments, variants and/or 

derivatives of the disclosed polypeptide and/or polynucleotide sequences, wherein the 
fragments, variants and/or derivatives preferably have a level of immunogenic activity 
of at least about 50%, preferably at least about 70% and more preferably at least about 
90% of the level of immunogenic activity of a polypeptide sequence set forth in SEQ ID 
10 NOs: 391, 393, 395, 397, 421, 425-427, 434-439, 584-587, 738, 739, 742, 745 and/or a 
polypeptide sequence encoded by a polynucleotide sequence set forth in SEQ ID NOs: 
217-390, 392, 394, 396, 398-420 422-424, 428-433 440-583, 588-732, 736, 737, 740, 
741, 744 and 746. 

The present invention further provides polynucleotides that encode a 
15 polypeptide described above, expression vectors comprising such polynucleotides and 
host cells transformed or transfected with such expression vectors. 

Within other aspects, the present invention provides pharmaceutical 
compositions comprising a polypeptide or polynucleotide as described above and a 
physiologically acceptable carrier. 
20 Within a related aspect of the present invention, the pharmaceutical 

compositions, e.g., vaccine compositions, are provided for prophylactic or therapeutic 
applications. Such compositions generally comprise an immunogenic polypeptide or 
polynucleotide of the invention and an immunostimulant, such as an adjuvant. 

The present invention further provides pharmaceutical compositions that 
25 comprise: (a) an antibody or antigen-binding fragment thereof that specifically binds to 
a polypeptide of the present invention, or a fragment thereof; and (b) a physiologically 
acceptable carrier. 

Within further aspects, the present invention provides pharmaceutical 
compositions comprising: (a) an antigen presenting cell that expresses a polypeptide as 
30 described above and (b) a pharmaceutically acceptable carrier or excipient. Illustrative 
antigen presenting cells include dendritic cells, macrophages, monocytes, fibroblasts 
and B cells. 

Within related aspects, pharmaceutical compositions are provided that 
comprise: (a) an antigen presenting cell that expresses a polypeptide as described above 
35 and (b) an immunostimulant. 
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The present invention further provides, in other aspects, fusion proteins 
that comprise at least one polypeptide as described above, as well as polynucleotides 
encoding such fusion proteins, typically in the form of pharmaceutical compositions, 
e.g., vaccine compositions, comprising a physiologically acceptable carrier and/or an 
5 immunostimulant. The fusions proteins may comprise multiple immunogenic 
polypeptides or portions/variants thereof, as described herein, and may further comprise 
one or more polypeptide segments for facilitating the expression, purification and/or 
immunogenicity of the polypeptide(s). 

Within further aspects, the present invention provides methods for 
10 stimulating an immune response in a patient, preferably a T cell response in a human 
patient, comprising administering a pharmaceutical composition described herein. The 
patient may be afflicted with lung cancer, in which case the methods provide treatment 
for the disease, or patient considered at risk for such a disease may be treated 
prophylactically. 

15 Within further aspects, the present invention provides methods for 

inhibiting the development of a cancer in a patient, comprising administering to a 
patient a pharmaceutical composition as recited above. The patient may be afflicted 
with lung cancer, in which case the methods provide treatment for the disease, or patient 
considered at risk for such a disease may be treated prophylactically. 

20 The present invention further provides, within other aspects, methods for 

removing tumor cells from a biological sample, comprising contacting a biological 
sample with T cells that specifically react with a polypeptide of the present invention, 
wherein the step of contacting is performed under conditions and for a time sufficient to 
permit the removal of cells expressing the protein from the sample. 

25 Within related aspects, methods are provided for inhibiting the 

development of a cancer in a patient, comprising administering to a patient a biological 
sample treated as described above. 

Methods are further provided, within other aspects, for stimulating 
and/or expanding T cells specific for a polypeptide of the present invention, comprising 

30 contacting T cells with one or more of: (i) a polypeptide as described above; (ii) a 
polynucleotide encoding such a polypeptide; and/or (iii) an antigen presenting cell that 
expresses such a polypeptide; under conditions and for a time sufficient to permit the 
stimulation and/or expansion of T cells. Isolated T cell populations comprising T cells 
prepared as described above are also provided. 
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Within further aspects, the present invention provides methods for 
inhibiting the development of a cancer in a patient, comprising administering to a 
patient an effective amount of a T cell population as described above. 

The present invention further provides methods for inhibiting the 
5 development of a cancer in a patient, comprising the steps of: (a) incubating CD4 + 
and/or CD8 + T cells isolated from a patient with one or more of: (i) a polypeptide 
comprising at least an immunogenic portion of polypeptide disclosed herein; (ii) a 
polynucleotide encoding such a polypeptide; and (iii) an antigen-presenting cell that 
expressed such a polypeptide; and (b) administering to the patient an effective amount 
10 of the proliferated T cells, and thereby inhibiting the development of a cancer in the 
patient. Proliferated cells may, but need not, be cloned prior to administration to the 
patient. 

Within further aspects, the present invention provides methods for 
determining the presence or absence of a cancer, preferably a lung cancer, in a patient 

15 comprising: (a) contacting a biological sample obtained from a patient with a binding 
agent that binds to a polypeptide as recited above; (b) detecting in the sample an amount 
of polypeptide that binds to the binding agent; and (c) comparing the amount of 
polypeptide with a predetermined cut-off value, and therefrom determining the presence 
or absence of a cancer in the patient. Within preferred embodiments, the binding agent 

20 is an antibody, more preferably a monoclonal antibody. 

The present invention also provides, within other aspects, methods for 
monitoring the progression of a cancer in a patient. Such methods comprise the steps 
of: (a) contacting a biological sample obtained from a patient at a first point in time 
with a binding agent that binds to a polypeptide as recited above; (b) detecting in the 

25 sample an amount of polypeptide that binds to the binding agent; (c) repeating steps (a) 
and (b) using a biological sample obtained from the patient at a subsequent point in 
time; and (d) comparing the amount of polypeptide detected in step (c) with the amount 
detected in step (b) and therefrom monitoring the progression of the cancer in the 
patient. 

30 The present invention further provides, within other aspects, methods for 

determining the presence or absence of a cancer in a patient, comprising the steps of: (a) 
contacting a biological sample obtained from a patient with an oligonucleotide that 
hybridizes to a polynucleotide that encodes a polypeptide of the present invention; (b) 
detecting in the sample a level of a polynucleotide, preferably mRNA, that hybridizes to 

35 the oligonucleotide; and (c) comparing the level of polynucleotide that hybridizes to the 
oligonucleotide with a predetermined cut-off value, and therefrom determining the 
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presence or absence of a cancer in the patient. Within certain embodiments, the amount 
of mRNA is detected via polymerase chain reaction using, for example, at least one 
oligonucleotide primer that hybridizes to a polynucleotide encoding a polypeptide as 
recited above, or a complement of such a polynucleotide. Within other embodiments, 
5 the amount of mRNA is detected using a hybridization technique, employing an 
oligonucleotide probe that hybridizes to a polynucleotide that encodes a polypeptide as 
recited above, or a complement of such a polynucleotide. 

In related aspects, methods are provided for monitoring the progression 
of a cancer in a patient, comprising the steps of: (a) contacting a biological sample 

10 obtained from a patient with an oligonucleotide that hybridizes to a polynucleotide that 
encodes a polypeptide of the present invention; (b) detecting in the sample an amount of 
a polynucleotide that hybridizes to the oligonucleotide; (c) repeating steps (a) and (b) 
using a biological sample obtained from the patient at a subsequent point in time; and 
(d) comparing the amount of polynucleotide detected in step (c) with the amount 

15 detected in step (b) and therefrom monitoring the progression of the cancer in the 
patient. 

Within further aspects, the present invention provides antibodies, such as 
monoclonal antibodies, that bind to a polypeptide as described above, as well as 
diagnostic kits comprising such antibodies. Diagnostic kits comprising one or more 
20 oligonucleotide probes or primers as described above are also provided. 

These and other aspects of the present invention will become apparent 
upon reference to the following detailed description. All references disclosed herein are 
hereby incorporated by reference in their entirety as if each was incorporated 
individually. 

25 

SEQUENCE IDENTIFIERS 

SEQ ID NO: 1 is the determined cDNA sequence for L363Cl.cons 
SEQ ID NO: 2 is the determined cDNA sequence for L263C2.cons 
SEQ ID NO: 3 is the determined cDNA sequence for L263C2c 

3 0 SEQ ID NO : 4 is the determined cDNA sequence for L263 C 1 .cons 

SEQ ID NO: 5 is the determined cDNA sequence for L263Clb 
SEQ ID NO: 6 is the determined cDNA sequence for L164C2.cons 
SEQ ID NO: 7 is the determined cDNA sequence for L164Cl.cons 
SEQ ID NO: 8 is the determined cDNA sequence for L366Cla 

35 SEQ ID NO: 9 is the determined cDNA sequence for L260C1 .cons 
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SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQ ID NO: 
SEQIDNO: 
SEQIDNO: 
SEQ ID NO: 
SEQ ID NO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQ ID NO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQ ID NO: 
SEQ ID NO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQ ID NO: 
SEQIDNO: 



the determined cDNA sequence for L163Clc 
the determined cDNA sequence for L163Clb 
the determined cDNA sequence for L255Cl.cons 
the determined cDNA sequence for L255Clb 
the determined cDNA sequence for L355Cl.cons 
the determined cDNA sequence for L366Cl.cons 
the determined cDNA sequence for L163Cla 
the determined cDNA sequence for LT86-1 
the determined cDNA sequence for LT86-2 
the determined cDNA sequence for LT86-3 
the determined cDNA sequence for LT86-4 
, the determined cDNA sequence for LT86-5 
; the determined cDNA sequence for LT86-6 
the determined cDNA sequence for LT86-7 
the determined cDNA sequence for LT86-8 
; the determined cDNA sequence for LT86-9 
; the determined cDNA sequence for LT86-10 
i the determined cDNA sequence for LT86-1 1 
i the determined cDNA sequence for LT86-12 
i the determined cDNA sequence for LT86-13 
; the determined cDNA sequence for LT86-14 
i the determined cDNA sequence for LT86-15 
i the predicted amino acid sequence for LT86-1 
i the predicted amino acid sequence for LT86-2 
i the predicted amino acid sequence for LT86-3 
: the predicted amino acid sequence for LT86-4 
: the predicted amino acid sequence for LT86-5 
; the predicted amino acid sequence for LT86-6 
: the predicted amino acid sequence for LT86-7 
i the predicted amino acid sequence for LT86-8 
i the predicted amino acid sequence for LT86-9 
: the predicted amino acid sequence for LT86-10 
: the predicted amino acid sequence for LT86-1 1 
i the predicted amino acid sequence for LT86-12 
; the predicted amino acid sequence for LT86-13 
: the predicted amino acid sequence for LT86-14 
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SEQ ID NO: 46 
SEQIDNO:47 
SEQIDNO:48 
SEQIDNO: 49 
SEQIDNO: 50 
SEQIDNO: 51 
SEQIDNO: 52 
SEQIDNO: 53 
SEQ ID NO: 54 
SEQIDNO: 55 
SEQIDNO: 56 
SEQIDNO: 57 
SEQ ID NO: 58 
SEQIDNO: 59 
SEQ ID NO: 60 
SEQ ID NO: 61 
SEQ ID NO: 62 
SEQ ID NO: 63 
SEQ ID NO: 64 
SEQ ID NO: 65 



LT86-9 



the predicted amino acid sequence 
a (dT)i2AG primer 
a primer 

the determined 5' cDNA sequence 
the determined 5' cDNA sequence 
the determined 5' cDNA sequence 
the determined 5' cDNA sequence 
the determined 5' cDNA sequence 
the determined 5' cDNA sequence 
the determined 5' cDNA sequence 
the predicted amino acid sequence 
the predicted amino acid sequence 
the predicted amino acid sequence 
the predicted amino acid sequence 
the predicted amino acid sequence 
the predicted amino acid sequence 
the predicted amino acid sequence 
the determined 5' cDNA sequence 
the determined 5' cDNA sequence 
i the predicted amino acid sequence 



forL86S-3 
for L86S-12 
for L86S-16 
forL86S-25 
forL86S-36 
for L86S-40 
forL86S-46 
forL86S-3 
forL86S-12 
forL86S-16 
for L86S-25 
forL86S-36 
forL86S-40 
for L86S-46 
for L86S-30 
for L86S-41 
from the 5' end of 



SEQ ID NO: 66 is the determined extended cDNA sequence for LT86-4 
SEQ ID NO: 67 is the predicted extended amino acid sequence for 



35 



SEQ ID NO 


68 


is 


the 


SEQ ID NO 


69 


is 


the 


SEQIDNO 


70 


is 


the 


SEQ ID NO 


71 


is 


the 


SEQIDNO 


72 


is 


the 


SEQIDNO 


73 


is 


the 


SEQIDNO 


74 


is 


the 


SEQIDNO- 


75 


is 


the 


SEQ ID NO 


76 


is 


the 


SEQIDNO: 


77 


is 


the 


SEQIDNO: 


78 


is 


the 


SEQIDNO: 


79 


is 


the 



determined 5' cDNA 
determined 3' cDNA 
determined 5' cDNA 
determined 5' cDNA 
determined 5' cDNA 
predicted amino acid 
predicted amino acid 
predicted amino acid 
predicted amino acid 
predicted amino acid 
determined extended 
determined extended 



sequence for LT86-20 
sequence for LT86-21 
sequence for LT86-22 
sequence for LT86-26 
sequence for LT86-27 
sequence for LT86-20 
sequence for LT86-21 
sequence for LT86-22 
sequence for LT86-26 
sequence for LT86-27 
cDNA sequence for L86S-12 
cDNA sequence for L86S-36 
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SEQ ID NO: 80 is the determined extended cDNA sequence for L86S-46 
SEQ ID NO: 81 is the predicted extended amino acid sequence for L86S- 

12 

SEQ ID NO: 82 is the predicted extended amino acid sequence for L86S- 

5 36 

SEQ ID NO: 83 is the predicted extended amino acid sequence for L86S- 

46 

SEQ ID NO: 84 is the determined 5'cDNA sequence for L86S-6 
SEQ ID NO: 85 is the determined 5'cDNA sequence for L86S-1 1 

10 SEQ ID NO: 86 is the determined 5'cDNA sequence for L86S-14 

SEQ ID NO: 87 is the determined 5'cDNA sequence for L86S-29 
SEQ ID NO: 88 is the determined 5'cDNA sequence for L86S-34 
SEQ ID NO: 89 is the determined 5'cDNA sequence for L86S-39 
SEQ ID NO: 90 is the determined 5'cDNA sequence for L86S-47 

1 5 SEQ ID NO: 91 is the determined 5'cDNA sequence for L86S-49 

SEQ ID NO: 92 is the determined 5'cDNA sequence for L86S-51 
SEQ ID NO: 93 is the predicted amino acid sequence for L86S-6 
SEQ ID NO: 94 is the predicted amino acid sequence for L86S-1 1 
SEQ ID NO: 95 is the predicted amino acid sequence for L86S-14 

20 SEQ ID NO: 96 is the predicted amino acid sequence for L86S-29 

SEQ ID NO: 97 is the predicted amino acid sequence for L86S-34 
SEQ ID NO: 98 is the predicted amino acid sequence for L86S-39 
SEQ ID NO: 99 is the predicted amino acid sequence for L86S-47 
SEQ ID NO: 100 is the predicted amino acid sequence for L86S-49 

25 SEQ ID NO: 101 is the predicted amino acid sequence for L86S-51 

SEQ ID NO: 102 is the determined DNA sequence for SLT-T1 
SEQ ID NO: 103 is the determined 5' cDNA sequence for SLT-T2 
SEQ ID NO: 104 is the determined 5' cDNA sequence for SLT-T3 
SEQ ID NO: 105 is the determined 5' cDNA sequence for SLT-T5 

3 0 SEQ ID NO : 1 06 is the determined 5 ' cDN A sequence for SLT-T7 

SEQ ID NO: 107 is the determined 5' cDNA sequence for SLT-T9 
SEQ ID NO: 108 is the determined 5' cDNA sequence for SLT-T10 
SEQ ID NO: 109 is the determined 5' cDNA sequence for SLT-T1 1 
SEQ ID NO: 1 10 is the determined 5' cDNA sequence for SLT-T12 

35 SEQ ID NO: 1 1 1 is the predicted amino acid sequence for SLT-T1 

SEQ ID NO: 1 12 is the predicted amino acid sequence for SLT-T2 
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30 



SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQ ID NO: 
SEQIDNO: 
SEQIDNO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQ ID NO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQ ID NO: 
SEQ ID NO: 
SEQIDNO: 
SEQIDNO: 
SEQ ID NO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 



is the predicted amino acid sequence for SLT-T3 
is the predicted amino acid sequence for SLT-T10 
is the predicted amino acid sequence for SLT-T12 
e determined 5' cDNA sequence for SALT-T3 
s the determined 5' cDNA sequence for SALT-T4 
s the determined 5' cDNA sequence for SALT-T7 
s the determined 5' cDNA sequence for SALT-T8 
s the determined 5' cDNA sequence for SALT-T9 
s the predicted amino acid sequence for SALT-T3 
s the predicted amino acid sequence for SALT-T4 
s the predicted amino acid sequence for S ALT-T7 
s the predicted amino acid sequence for SALT-T8 
is the predicted amino acid sequence for SALT-T9 
ls the determined cDNA sequence for PSLT-1 
ls the determined cDNA sequence for PSLT-2 
is the determined cDNA sequence for PSLT-7 
is the determined cDNA sequence for PSLT-1 3 
is the determined cDNA sequence for PSLT-27 
is the determined cDNA sequence for PSLT-28 
is the determined cDNA sequence for PSLT-30 
s the determined cDNA sequence for PSLT-40 
is the determined cDNA sequence for PSLT-69 
is the determined cDNA sequence for PSLT-7 1 
is the determined cDNA sequence for PSLT-73 
is the determined cDNA sequence for PSLT-79 
is the determined cDNA sequence for PSLT-03 
s the determined cDNA sequence for PSLT-09 
is the determined cDNA sequence for PSLT-01 1 
s the determined cDNA sequence for PSLT-041 
s the determined cDNA sequence for PSLT-62 
s the determined cDNA sequence for PSLT-6 
s the determined cDNA sequence for PSLT-37 
is the determined cDNA sequence for PSLT-74 
s the determined cDNA sequence for PSLT-010 
is the determined cDNA sequence for PSLT-01 2 
is the detennined cDNA sequence for PSLT-03 7 
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SEQ ID NO: 149 is the determined 5' cDNA sequence for SAL-3 
SEQ ID NO: 150 is the determined 5' cDNA sequence for SAL-24 
SEQ ID NO: 151 is the determined 5' cDNA sequence for SAL-25 
SEQ ID NO: 152 is the determined 5' cDNA sequence for SAL-3 3 
SEQ ID NO: 153 is the determined 5' cDNA sequence for SAL-50 
SEQ ID NO: 154 is the determined 5' cDNA sequence for SAL-57 
SEQ ID NO: 155 is the determined 5' cDNA sequence for SAL-66 
SEQ ID NO: 156 is the determined 5' cDNA sequence for SAL-82 
SEQ ID NO: 157 is the determined 5' cDNA sequence for SAL-99 
SEQ ID NO: 158 is the determined 5' cDNA sequence for SAL-104 
SEQ ID NO: 159 is the determined 5' cDNA sequence for SAL-109 
SEQ ID NO: 160 is the determined 5' cDNA sequence for SAL-5 
SEQ ID NO: 161 is the determined 5' cDNA sequence for SAL-8 
SEQ ID NO: 162 is the determined 5' cDNA sequence for SAL- 12 
SEQ ID NO: 163 is the determined 5' cDNA sequence for SAL- 14 
SEQ ID NO: 164 is the determined 5' cDNA sequence for SAL- 16 
SEQ ID NO: 165 is the determined 5' cDNA sequence for SAL-23 
SEQ ID NO: 166 is the determined 5' cDNA sequence for SAL-26 
SEQ ID NO: 167 is the determined 5' cDNA sequence for SAL-29 
SEQ ID NO: 168 is the determined 5' cDNA sequence for SAL-32 
SEQ ID NO: 169 is the determined 5' cDNA sequence for SAL-3 9 
SEQ ID NO: 170 is the determined 5' cDNA sequence for SAL-42 
SEQ ID NO: 171 is the determined 5' cDNA sequence for SAL-43 
SEQ ID NO: 172 is the determined 5' cDNA sequence for SAL-44 
SEQ ID NO: 173 is the determined 5' cDNA sequence for SAL-48 
SEQ ID NO: 174 is the determined 5' cDNA sequence for SAL-68 
SEQ ID NO: 175 is the determined 5' cDNA sequence for SAL-72 
SEQ ID NO : 1 76 is the determined 5 ' cDN A sequence for S AL-77 
SEQ ID NO: 177 is the determined 5' cDNA sequence for SAL-86 
SEQ ID NO: 178 is the determined 5' cDNA sequence for SAL-88 
SEQ ID NO: 179 is the determined 5' cDNA sequence for SAL-93 
SEQ ID NO: 180 is the determined 5' cDNA sequence for SAL-100 
SEQ ID NO: 181 is the determined 5' cDNA sequence for SAL- 105 
SEQ ID NO: 182 is the predicted amino acid sequence for SAL-3 
SEQ ID NO: 183 is the predicted amino acid sequence for SAL-24 
SEQ ID NO: 184 is a first predicted amino acid sequence for SAL-25 
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30 



SEQ ID NO: 
SEQIDNO: 
SEQ ID NO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQ ID NO: 
SEQ ID NO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQ ID NO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQIDNO: 
SEQ ID NO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQ ID NO: 
SEQ ID NO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQ ID NO: 
SEQ ID NO: 
SEQIDNO: 
SEQIDNO: 



is a second predicted amino acid sequence for SAL-25 
is the predicted amino acid sequence for SAL-33 
is a first predicted amino acid sequence for SAL-50 
is the predicted amino acid sequence for SAL-57 
s a first predicted amino acid sequence for SAL-66 
is a second predicted amino acid sequence for SAL-66 
s the predicted amino acid sequence for SAL-82 
s the predicted amino acid sequence for SAL-99 
s the predicted amino acid sequence for SAL- 104 
s the predicted amino acid sequence for SAL-5 
s the predicted amino acid sequence for SAL- 8 
s the predicted amino acid sequence for SAL- 12 
s the predicted amino acid sequence for SAL- 14 
is the predicted amino acid sequence for SAL- 16 
s the predicted amino acid sequence for SAL-23 
is the predicted amino acid sequence for S AL-26 
is the predicted amino acid sequence for SAL-29 
is the predicted amino acid sequence for SAL-32 
is the predicted amino acid sequence for SAL-39 
is the predicted amino acid sequence for SAL-42 
is the predicted amino acid sequence for S AL-43 
is the predicted amino acid sequence for SAL-44 
s the predicted amino acid sequence for SAL-48 
is the predicted amino acid sequence for SAL-68 
is the predicted amino acid sequence for SAL-72 
is the predicted amino acid sequence for SAL- 77 
is the predicted amino acid sequence for SAL-86 
is the predicted amino acid sequence for SAL-88 
is the predicted amino acid sequence for SAL-93 
is the predicted amino acid sequence for SAL- 100 
is the predicted amino acid sequence for SAL- 105 
is a second predicted amino acid sequence for SAL-50 
is the determined cDNA sequence for SSLT-4 
is the determined cDNA sequence for SSLT-9 
is the determined cDNA sequence for SSLT-1 0 
is the determined cDNA sequence for SSLT-1 2 



12 



WO 02/092001 



PCT7US02/14975 



35 



SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQ ID NO: 
SEQIDNO: 
SEQ ID NO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQ ID NO 
SEQIDNO: 
SEQIDNO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQ ID NO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 



is the determined cDNA sequence for SSLT-19 
is the determined cDNA sequence for SSLT-31 
is the determined cDNA sequence for SSLT-38 
is the determined cDNA sequence for LT4690-2 
is the determined cDNA sequence for LT4690-3 
is the determined cDNA sequence for LT4690-22 
is the determined cDNA sequence for LT4690-24 
s the determined cDNA sequence for LT4690-37 
s the determined cDNA sequence for LT4690-39 
s the determined cDNA sequence for LT4690-40 
s the determined cDNA sequence for LT4690-41 
s the determined cDNA sequence for LT4690-49 
s the determined 3' cDNA sequence for LT4690-55 
.s the determined 5' cDNA sequence for LT4690-55 
s the determined cDNA sequence for LT4690-59 
s the determined cDNA sequence for LT4690-63 
is the determined cDNA sequence for LT4690-71 
s the determined cDNA sequence for 2LT-3 
is the determined cDNA sequence for 2LT-6 
is the determined cDNA sequence for 2LT-22 
is the determined cDNA sequence for 2LT-25 
is the determined cDNA sequence for 2LT-26 
is the determined cDNA sequence for 2LT-3 1 
ls the determined cDNA sequence for 2LT-36 
is the determined cDNA sequence for 2LT-42 
is the determined cDNA sequence for 2LT-44 
is the determined cDNA sequence for 2LT-54 
is the determined cDNA sequence for 2LT-55 
is the determined cDNA sequence for 2LT-57 
is the determined cDNA sequence for 2LT-58 
is the determined cDNA sequence for 2LT-59 
s the determined cDNA sequence for 2LT-62 
is the determined cDNA sequence for 2LT-63 
is the determined cDNA sequence for 2LT-65 
is the deterrnined cDNA sequence for 2LT-66 
is the determined cDNA sequence for 2LT-70 
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SSLT-74 



SEQ 
SEQ 
SEQ 
SEQ 
SEQ 
SEQ 
SEQ 
SEQ 
SEQ 
SEQ 
SEQ 
SEQ 
SEQ 
SEQ 
SEQ 
SEQ 
SEQ 
SEQ 
SEQ 
SEQ 
SEQ 
SEQ 
SEQ 
SEQ 
SEQ 
SEQ 
SEQ 
SEQ 
SEQ 
SEQ 
SEQ 
SEQ 

SEQ 
SEQ 
SEQ 



ID NO: 257 is the determined cDNA sequence for 2LT-73 
ID NO: 258 is the determined cDNA sequence for 2LT-74 
ID NO: 259 is the determined cDNA sequence for 2LT-76 
ID NO: 260 is the determined cDNA sequence for 2LT-77 
ID NO: 261 is the determined cDNA sequence for 2LT-78 
ID NO: 262 is the determined cDNA sequence for 2LT-80 
ID NO: 263 is the determined cDNA sequence for 2LT-85 
ID NO: 264 is the determined cDNA sequence for 2LT-87 
ID NO: 265 is the determined cDNA sequence for 2LT-89 
ID NO: 266 is the determined cDNA sequence for 2LT-94 
ID NO: 267 is the determined cDNA sequence for 2LT-95 
ID NO: 268 is the determined cDNA sequence for 2LT-98 
ID NO: 269 is the determined cDNA sequence for 2LT-100 
ID NO: 270 is the determined cDNA sequence for 2LT-103 
ID NO: 271 is the determined cDNA sequence for 2LT-105 
ID NO: 272 is the determined cDNA sequence for 2LT-107 
ID NO: 273 is the determined cDNA sequence for 2LT-108 
ID NO: 274 is the determined cDNA sequence for 2LT-109 
ID NO: 275 is the determined cDNA sequence for 2LT-1 1 8 
ID NO: 276 is the determined cDNA sequence for 2LT-120 
ID NO: 277 is the determined cDNA sequence for 2LT-121 
ID NO: 278 is the determined cDNA sequence for 2LT-122 
ID NO: 279 is the determined cDNA sequence for 2LT-124 
ID NO: 280 is the determined cDNA sequence for 2LT-126 
ID NO: 281 is the determined cDNA sequence for 2LT-127 
ID NO: 282 is the determined cDNA sequence for 2LT-128 
ID NO: 283 is the determined cDNA sequence for 2LT-129 
ID NO: 284 is the determined cDNA sequence for 2LT-133 
ID NO: 285 is the determined cDNA sequence for 2LT-137 
ID NO: 286 is the determined cDNA sequence for LT4690-71 
ID NO: 287 is the determined cDNA sequence for LT4690-82 
ID NO: 288 is the determined full-length cDNA i 



for 



ID NO: 289 is the determined cDNA sequence for SSLT-78 
ID NO: 290 is the determined cDNA sequence for SCC1-8. 
ID NO: 291 is the determined cDNA sequence for SCC1-12. 
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SEQ ID NO: 292 is the determined cDNA sequence for SCC1-336 
SEQ ID NO: 293 is the determined cDNA sequence for SCC1-344 
SEQ ID NO: 294 is the determined cDNA sequence for SCC1-345 
SEQ ID NO: 295 is the determined cDNA sequence for SCC 1-346 
5 SEQ ID NO: 296 is the determined cDNA sequence for SCC1-348 

SEQ ID NO: 297 is the determined cDNA sequence for SCC1-350 
SEQ ID NO: 298 is the determined cDNA sequence for SCC1-352 
SEQ ID NO: 299 is the determined cDNA sequence for SCC 1-3 54 
SEQ ID NO: 300 is the determined cDNA sequence for SCC1-355 

10 SEQ ID NO: 301 is the determined cDNA sequence for SCC1-356 

SEQ ID NO: 302 is the determined cDNA sequence for SCC1-357 
SEQ ID NO: 303 is the determined cDNA sequence for SCC1-501 
SEQ ID NO: 304 is the determined cDNA sequence for SCC1-503 
SEQ ID NO: 305 is the determined cDNA sequence for SCC 1-5 13 

1 5 SEQ ID NO: 3 06 is the determined cDNA sequence for SCC 1-516 

SEQ ID NO: 307 is the determined cDNA sequence for SCC 1-5 18 
SEQ ID NO: 308 is the determined cDNA sequence for SCC1-519 
SEQ ID NO: 309 is the determined cDNA sequence for SCC1-522 
SEQ ID NO: 310 is the determined cDNA sequence for SCC 1-523 

20 SEQ ID NO: 3 1 1 is the determined cDNA sequence for SCC1 -525 

SEQ ID NO: 312 is the determined cDNA sequence for SCC1-527 
SEQ ID NO: 313 is the determined cDNA sequence for SCC 1-529 
SEQ ID NO: 314 is the determined cDNA sequence for SCC1-530 
SEQ ID NO: 315 is the determined cDNA sequence for SCC1-531 

25 SEQ ID NO: 3 16 is the determined cDNA sequence for SCC1-532 

SEQ ID NO: 317 is the determined cDNA sequence for SCC1-533 
SEQ ID NO: 318 is the determined cDNA sequence for SCC1-536 
SEQ ID NO: 3 19 is the determined cDNA sequence for SCC 1-53 8 
SEQ ID NO: 320 is the determined cDNA sequence for SCC1-539 

30 SEQ ID NO: 321 is the determined cDNA sequence for SCC1-541 

SEQ ID NO: 322 is the determined cDNA sequence for SCC1-542 
SEQ ID NO: 323 is the determined cDNA sequence for SCC 1-546 
SEQ ID NO: 324 is the determined cDNA sequence for SCC1-549 
SEQ ID NO: 325 is the determined cDNA sequence for SCC1-551 

35 SEQ ID NO: 326 is the determined cDNA sequence for SCC1-552 

SEQ ID NO: 327 is the determined cDNA sequence for SCC 1-554 
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35 



SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQIDNO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQIDNO: 
SEQIDNO: 
SEQ ID NO: 
SEQIDNO: 
SEQ ID NO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQ ID NO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 



is the determined 
the determined 
s the determined 
s the determined 
s the determined 
s the determined 
.s the determined 
s the determined 
s the determined 
is the determined 
is the determined 
is the determined 
is the determined 
ls the determined 
is the determined 
is the determined 
is the determined 
is the determined 
is the determined 
is the determined 
is the determined 
is the determined 
is the determined 
is the determined 
is the determined 
is the determined 
is the determined 
ls the determined 
s the determined 
is the determined 
is the determined 

359 is the determined 

360 is the determined 

361 is the determined 

362 is the determined 

363 is the determined 



cDNA sequence 
cDNA sequence 
cDNA sequence 
cDNA sequence 
cDNA sequence 
. cDNA sequence 
cDNA sequence 
cDNA sequence 
cDNA sequence 
cDNA sequence 
cDNA sequence 
cDNA sequence 
cDNA sequence 
cDNA sequence 
cDNA sequence 
cDNA sequence 
cDNA sequence 
cDNA sequence 
cDNA sequence 
cDNA sequence 
cDNA sequence 
cDNA sequence 
cDNA sequence 
cDNA sequence 
cDNA sequence 
cDNA sequence 
cDNA sequence 
cDNA sequence 
cDNA sequence 
cDNA sequence 
cDNA sequence 
cDNA sequence 
cDNA sequence 
cDNA sequence 
cDNA sequence 
. cDNA sequence 



forSCCl-558 
for SCC1-559 
for SCC1-561 
for SCC1-562 
forSCCl-564 
for SCC1-565 
for SCC1-566 
for SCC 1-567 
for SCC1-568 
for SCC 1-570 
for SCC 1-572 
for SCC 1-575 
for SCC 1-576 
for SCC 1-577 
for SCC1-578 
for SCC1-582 
for SCC 1-583 
for SCC1-586 
for SCC 1-588 
for SCC1-590 
for SCC1-591 
for SCC 1-592 
for SCC 1-593 
for SCC1-594 
forSCCl-595 
for SCC 1-596 
for SCC 1-598 
for SCC 1-599 
for SCC1-602 
for SCC 1-604 
for SCC 1-605 
for SCC1-606 
for SCC 1-607 
for SCC 1-608 
for SCC1-610 
for clone DMS79T1 
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DMS79T10 
10 DMS79T11 



20 

NCIH69T3 
NCIH69T5 

25 

NCIH69T6 
NCIH69T7 
30 NCIH69T9 
NCIH69T10 
NCIH69T11 



35 



SEQ ID NO: 364 is the determined cDNA sequence for clone DMS79T2 
SEQ ID NO: 365 is the determined cDNA sequence for clone DMS79T3 
SEQ ID NO: 366 is the determined cDNA sequence for clone DMS79T5 
SEQ ID NO: 367 is the determined cDNA sequence for clone DMS79T6 
SEQ ID NO: 368 is the determined cDNA sequence for clone DMS79T7 
SEQ ID NO: 369 is the determined cDNA sequence for clone DMS79T9 
SEQ ID NO: 370 is the determined cDNA sequence for clone 

SEQ ID NO: 371 is the determined cDNA sequence for clone 

SEQ ID NO: 372 is the determined cDNA sequence for clone 128T1 
SEQ ID NO: 373 is the determined cDNA sequence for clone 128T2 
SEQ ID NO: 374 is the determined cDNA sequence for clone 128T3 
SEQ ID NO: 375 is the determined cDNA sequence for clone 128T4 
SEQ ID NO: 376 is the determined cDNA sequence for clone 128T5 
SEQ ID NO: 377 is the determined cDNA sequence for clone 128T7 
SEQ ID NO: 378 is the determined cDNA sequence for clone 128T9 
SEQ ID NO: 379 is the determined cDNA sequence for clone 128T10 
SEQ ID NO: 380 is the determined cDNA sequence for clone 128T1 1 
SEQ ID NO: 381 is the determined cDNA sequence for clone 128T12 
SEQ ID NO: 382 is the determined cDNA sequence for clone 

SEQ ID NO: 383 is the determined cDNA sequence for clone 

SEQ ID NO: 384 is the determined cDNA sequence for clone 

SEQ ID NO: 385 is the determined cDNA sequence for clone 

SEQ ID NO: 386 is the determined cDNA sequence for clone 

SEQ ID NO: 387 is the determined cDNA sequence for clone 

SEQ ID NO: 388 is the determined cDNA sequence for clone 

SEQ ID NO: 389 is the determined cDNA sequence for clone 



NCIH69T12 
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SEQ ID NO: 390 is the full-length cDNA sequence for 128T1 

SEQ ID NO: 391 is the amino acid sequence for 128T1 

SEQ ID NO: 392 is the full-length cDNA sequence for 2LT-128 

SEQ ID NO: 393 is the amino acid sequence for 2LT-128 

SEQ ID NO: 394 is an extended cDNA sequence for clone SCC 1-542 

SEQ ID NO: 395 is the amino acid sequence corresponding to SEQ ID 



SEQ 
SEQ 



ID NO: 396 is an extended cDNA sequence for clone SCC 1-5 93 
ID NO: 397 is the amino acid sequence corresponding to SEQ ID 



SEQ 
SEQ 
SEQ 
SEQ 
SEQ 
SEQ 
SEQ 
SEQ 
SEQ 
SEQ 
SEQ 
SEQ 
SEQ 
SEQ 
SEQ 
SEQ 
SEQ 
SEQ 
SEQ 
SEQ 
SEQ 
SEQ 
SEQ 
SEQ 
SEQ 
SEQ 



IDNO:398 
IDNO:399 
IDNO:400 
IDNO:401 
IDNO:402 
IDNO:403 
ID NO:404 
IDNO:405 
IDNO:406 
ID NO:407 
IDNO:408 
IDNO:409 
IDNO:410 
IDNO:411 
IDNO:412 
IDNO:413 
ID NO:414 
IDNO:415 
IDNO:416 
IDNO:417 
IDNO:418 
IDNO:419 
ID NO:420 
IDNO:421 
IDNO:422 
IDNO:423 



s the determined cDNA sequence for 55508.1 
is the determined cDNA sequence for 55509.1 
is the determined cDNA sequence for 54243.1 
is the determined cDNA sequence for 54251.1 
is the determined cDNA sequence for 54252.1 
is the determined cDNA sequence for 54253.1 
is the determined cDNA sequence for 55518.1 
is the determined cDNA sequence for 54258.1 
is the determined cDNA sequence for 54575.1 
is the determined cDNA sequence for 54577.1 
is the determined cDNA sequence for 54584.1 
s the determined cDNA sequence for 55521.1 
is the determined cDNA sequence for 54589.1 
is the determined cDNA sequence for 54592.1 
is the determined cDNA sequence for 55134.1 
is the determined cDNA sequence for 55137.1 
the determined cDNA sequence for 55140.1 
s the determined cDNA sequence for 55531.1 
is the determined cDNA sequence for 55532.1 
s the determined cDNA sequence for 54621.1 
s the determined cDNA sequence for 55548.1 
is the determined cDNA sequence for 54623.1 
s the determined cDNA sequence for L39 
s the predicted amino acid sequence for L39 ' 
is the determined cDNA sequence for SCC2-29 
is the determined cDNA sequence for SCC2-36 
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SEQ ID NO:424 is the determined cDNA sequence for SCC2-60 
SEQ ID NO:425 is the predicted amino acid sequence for SCC2-29 
SEQ ID NO:426 is the predicted amino acid sequence for SCC2-36 
SEQ ID NO:427 is the predicted amino acid sequence for SCC2-60 
5 SEQ ID NO:428 is an extended cDNA sequence for the clone 20129, 

also referred to as 2LT-3, set forth in SEQ ID NO: 238 

SEQ ID NO:429 is an extended cDNA sequence for the clone 20347, 

also referred to as 2LT-26, set forth in SEQ ID NO: 242 

SEQ ID NO:430 is an extended cDNA sequence for the clone 21282, 
10 also referred to as 2LT-57, set forth in SEQ ID NO: 249 

SEQ ID NO:431 is an extended cDNA sequence for the clone 21283, 

also referred to as 2LT-58, set forth in SEQ ID NO: 250 

SEQ ID NO:432 is an extended cDNA sequence for the clone 21484, 

also referred to as 2LT-98, set forth in SEQ ID NO: 268 
15 SEQ ID NO:433 is an extended cDNA sequence for the clone 21871, 

also referred to as 2LT-124, set forth in SEQ ID NO: 279 

SEQ ID NO:434 is an amino acid sequence encoded by SEQ ID NO: 428 
SEQ ID NO:435 is an amino acid sequence encoded by SEQ ID NO: 429 
SEQ ID NO:436 is an amino acid sequence encoded by SEQ ID NO: 430 
20 SEQ ID NO:437 is an amino acid sequence encoded by SEQ ID NO: 43 1 

SEQ ID NO:438 is an amino acid sequence encoded by SEQ ID NO: 432 
SEQ ID NO:439 is an amino acid sequence encoded by SEQ ID NO: 433 
SEQ ID NO:440 is the determined cDNA sequence for clone 19A4 
SEQ ID NO: 441 is the determined full-length cDNA sequence for clone 

25 14F10. 

SEQ ID NO: 442 is the determined 5' cDNA sequence for clone 20E10. 
SEQ ID NO: 443 is a first determined cDNA sequence for clone 55153. 
SEQ ID NO: 444 is a second determined cDNA sequence for clone 



55154. 



55156. 



SEQ ID NO: 445 is a first determined cDNA sequence for clone 55154. 
SEQ ID NO: 446 is a second determined cDNA sequence for clone 

SEQ ID NO: 447 is the determined cDNA sequence for clone 55155. 
SEQ ID NO: 448 is a first determined cDNA sequence for clone 55156. 
SEQ ID NO: 449 is a second determined cDNA sequence for clone 
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55164. 



20 55165. 



SEQ ID NO: 450 is a first determined cDNA sequence for clone 55157. 
SEQ ID NO: 451 is a second determined cDNA sequence for clone 

SEQ ID NO: 452 is the determined cDNA sequence for clone 55158. 
SEQ ID NO: 453 is the determined cDNA sequence for clone 55159. 
SEQ ID NO: 454 is a first determined cDNA sequence for clone 55161. 
SEQ ID NO: 455 is a second determined cDNA sequence for clone 

SEQ ID NO: 456 is a first determined cDNA sequence for clone 55162. 
SEQ ID NO: 457 is a second determined cDNA sequence for clone 

SEQ ID NO: 458 is a first determined cDNA sequence for clone 55163. 
SEQ ID NO: 459 is a second determined cDNA sequence for clone 

SEQ ID NO: 460 is a first determined cDNA sequence for clone 55164. 
SEQ ID NO: 461 is a second determined cDNA sequence for clone 

SEQ ID NO: 462 is a first determined cDNA sequence for clone 55165. 
SEQ ID NO: 463 is a second determined cDNA sequence for clone 

SEQ ID NO: 464 is a first determined cDNA sequence for clone 55166. 
SEQ ID NO: 465 is a second determined cDNA sequence for clone 

SEQ ID NO: 466 is a first determined cDNA sequence for clone 55167. 
SEQ ID NO: 467 is a second determined cDNA sequence for clone 

SEQ ID NO: 468 is a first determined cDNA sequence for clone 55168. 
SEQ ID NO: 469 is a second determined cDNA sequence for clone 

SEQ ID NO: 470 is a first determined cDNA sequence for clone 55169. 
SEQ ID NO: 471 is a second determined cDNA sequence for clone 



SEQ ID NO: 472 is a first determined cDNA sequence for clone 55170. 
SEQ ID NO: 473 is a second determined cDNA sequence for clone 



SEQ ID NO: 474 is the determined cDNA sequence for clone 55171. 
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SEQ ID NO: 475 is the determined cDNA sequence for clone 55172. 
SEQ ID NO: 476 is the determined cDNA sequence for clone 55173. 
SEQ ID NO: 477 is a first determined cDNA sequence for clone 55174. 
SEQ ID NO: 478 is a second determined cDNA sequence for clone 

5 55174. 

SEQ ID NO: 479 is the determined cDNA sequence for clone 55175. 

SEQ ID NO: 480 is the determined cDNA sequence for clone 55176. 

SEQ ID NO: 481 is the determined cDNA sequence for contig 525. 

SEQ ID NO: 482 is the determined cDNA sequence for contig 526. 
10 SEQ ID NO: 483 is the determined cDNA sequence for contig 527. 

SEQ ID NO: 484 is the determined cDNA sequence for contig 528. 

SEQ ID NO: 485 is the determined cDNA sequence for contig 529. 

SEQ ID NO: 486 is the determined cDNA sequence for contig 530. 

SEQ ID NO: 487 is the determined cDNA sequence for contig 53 1 . 
1 5 SEQ ID NO: 488 is the determined cDNA sequence for contig 532. 

SEQ ID NO: 489 is the determined cDNA sequence for contig 533. 

SEQ ID NO: 490 is the determined cDNA sequence for contig 534. 

SEQ ID NO: 491 is the determined cDNA sequence for contig 535. 

SEQ ID NO: 492 is the determined cDNA sequence for contig 536. 
20 SEQ ID NO: 493 is the determined cDNA sequence for contig 537. 

SEQ ID NO: 494 is the determined cDNA sequence for contig 538. 

SEQ ID NO: 495 is the determined cDNA sequence for contig 539. 

SEQ ID NO: 496 is the determined cDNA sequence for contig 540. 

SEQ ID NO: 497 is the determined cDNA sequence for contig 541. 
25 SEQ ID NO: 498 is the determined cDNA sequence for contig 542. 

SEQ ID NO: 499 is the determined cDNA sequence for contig 543. 

SEQ ID NO: 500 is the determined cDNA sequence for contig 544: 

SEQ ID NO: 501 is the determined cDNA sequence for contig 545. 

SEQ ID NO: 502 is the determined cDNA sequence for contig 546. 
30 SEQ ID NO: 503 is the determined cDNA sequence for contig 547. 

SEQ ID NO: 504 is the determined cDNA sequence for contig 548. 

SEQ ID NO: 505 is the determined cDNA sequence for contig 549. 

SEQ ID NO: 506 is the determined cDNA sequence for contig 550. 

SEQ ID NO: 507 is the determined cDNA sequence for contig 551. 
35 SEQ ID NO: 508 is the detennined cDNA sequence for contig 552. 

SEQ ID NO: 509 is the determined cDNA sequence for contig 553. 
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CTJA TTV XTA. 

oiiQ ii-> JNU. 


510 


is the 




CPA TT"-\ \TA. 

bbQ iv JNU: 


511 


is the 




CCA TT\ XTA. 

br-Q IJD JNU. 


512 


is the 




bJbQ ID JNU: 


513 


is the 


5 


CT?A TT~\ \TA. 

bJbQ ID JNU: 


514 


is the 




SEQ ID NO 


515 


is the 




SEQ ID NO: 


516 


is the 




biiQ ID JNU 


517 


is the 




SEQ ID NO 


518 


is the 


10 


H17A TT\ \TA 

SEQ ID NO 


519 


is the 




SEQ ID NO" 


520 


is the 




SEQ ID NO 


521 


is the 




SEQ ID NO 


522 


is the 




OCA TTV XTA 

bilQ ID JNU 


523 


is the 


15 


SEQ ID NO 


524 


is the 




SEQ ID NO 


525 


is the 




SEQ ID NO 


526 


is the 




O T7 A TT\ XTA 

bbQ ID NU 


527 


is the 




nnA T~P\ XTA 

bbQ ID JNU 


528 


is the 


20 


SEQ ID NO 


529 


is the 




SEQ ID NO 


530 


is the 




SEQ ID NO: 


531 


is the 




SEQ ID NO 


532 


is the 




SEQ ID NO: 


533 


is the 


25 


SEQ ID NO 


534 


is the 




CCA T"H\ XTA. 

bJbQ LU JNU: 


535 


is the 




SEQ ID NO: 


536 


is the 




OT7A T"P\ XTA. 

oJbQ ID JNU: 


537 


is the 




CTJA TT\ XTA. 

bJbQ ID JNU: 


538 


is the 


30 


CTJA TT» XTA. 

oJbU^ 1JJ JNU: 


539 


is the 




OT7A T"PV XTA. 

oJiQ ID JNU: 


540 


is the 




SEQ ID NO: 


541 


is the 




SEQ ID NO: 


542 


is the 




SEQ ID NO: 


543 


is the 


35 


SEQ ID NO: 544 


is the 




SEQ ID NO: 545 


is the 



determined cDNA sequence for contig 554. 
determined cDNA sequence for contig 555. 
determined cDNA sequence for clone 57207. 
determined cDNA sequence for clone 57209. 
determined cDNA sequence for clone 57210. 
determined cDNA sequence for clone 5721 1. 
determined cDNA sequence for clone 57212. 
determined cDNA sequence for clone 57213. 
determined cDNA sequence for clone 57215. 
determined cDNA sequence for clone 57219. 
determined cDNA sequence for clone 57221. 
determined cDNA sequence for clone 57222. 
determined cDNA sequence for clone 57223. 
determined cDNA sequence for clone 57225. 
determined cDNA sequence for clone 57227. 
determined cDNA sequence for clone 57228. 
determined cDNA sequence for clone 57229. 
determined cDNA sequence for clone 57230. 
determined cDNA sequence for clone 57231. 
determined cDNA sequence for clone 57232. 
determined cDNA sequence for clone 57233. 
determined cDNA sequence for clone 57234. 
determined cDNA sequence for clone 57235. 
determined cDNA sequence for clone 57236. 
determined cDNA sequence for clone 57237. 
determined cDNA sequence for clone 57238. 
determined cDNA sequence for clone 57239. 
determined cDNA sequence for clone 57240. 
determined cDNA sequence for clone 57242. 
determined cDNA sequence for clone 57243. 
determined cDNA sequence for clone 57245. 
determined cDNA sequence for clone 57248. 
determined cDNA sequence for clone 57249. 
determined cDNA sequence for clone 57250. 
determined cDNA sequence for clone 57251. 
determined cDNA sequence for clone 57253. 
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SEQ ID NO: 
SEQIDNO: 
SEQ ID NO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQ ID NO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQ ID NO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQ ID NO: 
SEQIDNO: 
SEQIDNO: 
SEQ ID NO: 
SEQIDNO: 
SEQ ID NO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQ ID NO: 
SEQIDNO: 
SEQIDNO: 



isthe 
is the 
is the 
is the 
isthe 
is the 
is the 
sthe 
is the 
is the 
sthe 
isthe 
isthe 
s the 
is the 
isthe 
s the 
is the 
s the 
s the 
s the 
is the 
isthe 
is the 
isthe 
is the 
isthe 
isthe 
isthe 
.s the 
sthe 
s the 
the 
s the 
sthe 
sthe 



determined cDNA 
determined cDNA 
determined cDNA 
determined cDNA 
determined cDNA 
determined cDNA 
determined cDNA 
determined cDNA 
determined cDNA 
determined cDNA 
determined cDNA 
determined cDNA 
determined cDNA 
determined cDNA 
determined cDNA 
determined cDNA 
determined cDNA 
determined cDNA 
determined cDNA 
determined cDNA 
determined cDNA 
determined cDNA 
determined cDNA 
determined cDNA 
determined cDNA 
determined cDNA 
determined cDNA 
determined cDNA 
determined cDNA 
determined cDNA 
determined cDNA 
determined cDNA 
determined cDNA 
determined cDNA 
determined cDNA 
determined cDNA 



sequence for clone 57254. 
sequence for clone 57255. 
sequence for clone 57257. 
sequence for clone 57258. 
sequence for clone 57259. 
sequence for clone 57261. 
sequence for clone 57262. 
sequence for clone 57263. 
sequence for clone 57264. 
sequence for clone 57265. 
sequence for clone 57266. 
sequence for clone 57267. 
sequence for clone 57268. 
sequence for clone 57269. 
sequence for clone 57270. 
sequence for clone 57271. 
sequence for clone 57272. 
sequence for clone 57274. 
sequence for clone 57275. 
sequence for clone 57277. 
sequence for clone 57280. 
sequence for clone 57281. 
sequence for clone 57282. 
sequence for clone 57283. 
sequence for clone 57285. 
sequence for clone 57287. 
sequence for clone 57288. 
sequence for clone 57289. 
sequence for clone 57290. 
sequence for clone 57292. 
sequence for clone 57295. 
sequence for clone 57296. 
sequence for clone 57297. 
sequence for clone 57299. 
sequence for clone 57301. 
sequence for clone 57302. 
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SEQ ID NO: 582 is the determined cDNA sequence for the beta chain of 
a lung tumor specific T cell receptor. 

SEQ ID NO: 583 is the determined cDNA sequence for the alpha chain 
of a lung tumor specific T cell receptor. 
5 SEQ ID NO: 584 is the amino acid sequence encoded by SEQ ID NO: 

583. 

SEQ ID NO: 585 is the amino acid sequence encoded by SEQ ID NO: 

582. 

SEQ ID NO: 586 is the amino acid sequence encoded by the 5' terminus 

10 ofl4F10. 

SEQ ID NO: 587 is the amino acid sequence of a T cell epitope 
contained within SEQ ID NO: 586. 

SEQ ID NO:588 is the determined cDNA sequence for 54533 
SEQ ID NO:589 is the determined cDNA sequence for 54534 

1 5 SEQ ID NO:590 is the determined cDNA sequence for 54536 

SEQ ID NO:591 is the determined cDNA sequence for 54538 
SEQ ID NO:592 is the determined cDNA sequence for 54540 
SEQ ID NO:593 is the determined cDNA sequence for 55084 
SEQ ID NO:594 is the determined cDNA sequence for 55086 

20 SEQ ID NO:595 is the determined cDNA sequence for 54555 

SEQ ID NO:596 is the determined cDNA sequence for 54557 
SEQ ID NO:597 is the determined cDNA sequence for 54564 
SEQ ID NO:598 is the determined cDNA sequence for 55098 
SEQ ID NO:599 is the determined cDNA sequence for 55473 

25 SEQ ID NO:600 is the determined cDNA sequence for 55 104 

SEQ ID NO:601 is the determined cDNA sequence for 55105 
SEQ ID NO:602 is the determined cDNA sequence for 55107 
SEQ ID NO:603 is the determined cDNA sequence for 55108 
SEQ ID NO:604 is the determined cDNA sequence for 551 14 

30 SEQ ID NO:605 is the determined cDNA sequence for 55477 

SEQ ID NO:606 is the determined cDNA sequence for 55482 
SEQ ID NO:607 is the determined cDNA sequence for 55483 
SEQ ID NO:608 is the determined cDNA sequence for 55485 
SEQ ID NO:609 is the determined cDNA sequence for 55487 

35 SEQ ID NO:610 is the determined cDNA sequence for 55488 

SEQ ID NO:61 1 is the determined cDNA sequence for 55087 
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SEQID 
SEQID 
SEQID 
SEQID 
SEQID 
SEQID 
SEQID 
SEQID 
SEQID 
SEQID 
SEQID 
SEQID 
SEQID 
SEQID 
SEQID 
SEQID 
SEQID 
SEQID 
SEQID 
SEQ ID 
SEQID 
SEQID 
SEQID 
SEQID 
SEQID 
SEQID 
SEQID 
SEQID 
SEQID 
SEQID 
SEQID 
SEQID 
SEQID 
SEQID 
SEQID 
SEQID 



NO:612 
NO:613 
NO:614 
NO:615 
NO:616 
NO:617 
NO:618 
NO:619 
NO:620 
NO:621 
NO:622 
NO:623 
NO:624 
NO:625 
NO:626 
NO:627 
NO:628 
NO:629 
NO:630 
NO:631 
NO:632 
NO:633 
NO:634 
NO:635 
NO:636 
NO:637 
NO:638 
NO:639 
NO:640 
NO:641 
NO:642 
NO:643 
NO:644 
NO:645 
NO:646 
NO:647 



is the determined cDNA 
is the determined cDNA 
is the determined cDNA 
is the determined cDNA 
is the determined cDNA 
is the determined cDNA 
s the determined cDNA 
,s the determined cDNA 
s the determined cDNA 
s the determined cDNA 
s the determined cDNA 
s the determined cDNA 
s the determined cDNA 
is the determined cDNA 
s the determined cDNA 
is the determined cDNA 
is the determined cDNA 
ls the determined cDNA 
s the determined cDNA 
is the determined cDNA 
s the determined cDNA 
is the determined cDNA 
s the determined cDNA 
is the determined cDNA 
is the determined cDNA 
is the determined cDNA 
is the determined cDNA 
is the determined cDNA 
is the determined cDNA 
.s the determined cDNA 

the determined cDNA 
.s the determined cDNA 

the determined cDNA 
s the determined cDNA 
s the determined cDNA 
s the determined cDNA 



sequence 
sequence 
sequence 
sequence 
sequence 
sequence 
sequence 
sequence 
sequence 
sequence 
sequence 
sequence 
sequence 
sequence 
sequence 
sequence 
sequence 
sequence 
sequence 
sequence 
sequence 
sequence 
sequence 
sequence 
sequence 
sequence 
sequence 
sequence 
sequence 
sequence 
sequence 
sequence 
sequence 
sequence 
sequence 
sequence 



for 55089 
for 55092 
for 55093 
for 56926 
for 56930 
for 56944 
for 56945 
for 55490 
for 55495 
for 55504 
for 55506 
for 56480 
for 56482 
for 56484 
for 56487 
for 56488 
for 56490 
for 56493 
for 56494 
for 56495 
for 56499 
for 56517 
for 56952 
for 56953 
for 56959 
for 57139 
for 57078 
for 57092 
for 57099 
for 57100 
for 57105 
for 57111 
for 57117 
for 57121 
for 57124 
for 57125 
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SEQ ID NO:648-686 are the determined cDNA sequences for the clones 
described in Tables 9-10. 

SEQ ID NO:687-727 are the determined cDNA sequences for the clones 
described in Tables 11-13. 
5 SEQ ID NO:728 is the determined full-length cDNA sequence for clone 

DMS39 (partial sequence given in SEQ ID NO:695). 

SEQ ID NO:729 is the determined full-length cDNA sequence for clone 
DMS126 (partial sequence given in SEQ ID NO:708). 

SEQ ID NO-.730 is the determined full-length cDNA sequence for clone 
10 DMS218 (partial sequence given in SEQ ID NO:720). 

SEQ ID NO:731 is the determined full-length cDNA sequence for clone 
DMS253 (partial sequence given in SEQ ID NO:723). 

SEQ ID NO:732 is the determined full-length cDNA sequence for clone 
LSCC-86 (partial sequence given in SEQ ID NO:665). 
15 SEQ ID NO:733 is a first amino acid sequence encoded by SEQ ID 

NO:732 and designated LSCC-86proteinl. 

SEQ ID NO:734 is a second amino acid sequence encoded by SEQ ID 
NO:732 and designated LSCC-86protein2. 

SEQ ID NO:735 is a third amino acid sequence encoded by SEQ ID 
20 NO:732 and designated LSCC-86protein3. 

SEQ ID NO:736 is the determined full-length nucleic acid sequence of 
the cDNA insert contained in clone L86S-47. 

SEQ ID NO:737 is the determined full-length nucleic acid sequence of 
the cDNA insert contained in clone L86S-39. 
25 SEQ ID NO: 73 8 is the predicted amino acid sequence corresponding to 

an open reading frame contained in SEQ ID NO:736. 

SEQ ID NO: 73 9 is the predicted amino acid sequence corresponding to 
an open reading frame contained in SEQ ID NO: 73 7. 

SEQ ID NO:740 is the determined nucleic acid sequence of a composite 
30 DNA clone containing an extended sequence related to SEQ ID NO:440. 

SEQ ID NO: 741 is the determined nucleic acid sequence of an open 
reading frame contained in SEQ ID NO: 740, identified by anchored PGR cloning, 
encoding the polypeptide designated L200T. 

SEQ ID NO:742 is the predicted amino acid sequence of the polypeptide 
35 designated L200T encoded by the open reading frame set forth in SEQ ID NO:741 . 
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SEQ ID NO:743 is the nucleic acid sequence of the cysteine/glutamate 
transporter corresponding to GenSeqNo. Z16609. 

SEQ ID NO:744 are the 47 additional nucleotides 5' of the first 
nucleotide of SEQ ID NO:440 5 identified by anchored PCR cloning. 
5 SEQ ID NO: 745 are the additional 16 amino acids encoded by the 

extended open reading frame of SEQ ID NO:741. 

SEQ ID NO:746 is the nucleic acid sequence of human chromosome 4 
corresponding to GenBank Accession number ACO93903. 

SEQ ID NO:747 is a nucleic acid sequence derived from SEQ ID 
1 0 NO:746 containing exon 1 , intron 1 and exon 2 of L200T. 

SEQ ID NO:748 is the oligonucleotide sequence of the forward PCR 
primer used to prepare a vector for recombinant L200T fusion protein expression in E. 
coli host cells. 

SEQ ID NO:749 is the oligonucleotide sequence of the reverse PCR 
15 primer used to prepare a vector for recombinant L200T fusion protein expression in E. 
coli host cells. 

DETAILED DESCRIPTION OF THE INVENTION 

U.S. patents, U.S. patent application publications, U.S. patent 
applications, foreign patents, foreign patent applications and non-patent publications 

20 referred to in this specification and/or listed in the Application Data Sheet, are 
incorporated herein by reference, in their entirety. 

The present invention is directed generally to compositions and their use 
in the therapy and diagnosis of cancer, particularly lung cancer. As described further 
below, illustrative compositions of the present invention include, but are not restricted 

25 to, polypeptides, particularly immunogenic polypeptides, polynucleotides encoding such 
polypeptides, antibodies and other binding agents, antigen presenting cells (APCs) and 
immune system cells (e.g., T cells). 

The practice of the present invention will employ, unless indicated 
specifically to the contrary, conventional methods of virology, immunology, 

30 microbiology, molecular biology and recombinant DNA techniques within the skill of 
the art, many of which are described below for the purpose of illustration. Such 
techniques are explained fully in the literature. See, e.g., Sambrook, et al. Molecular 
Cloning: A Laboratory Manual (2nd Edition, 1989); Maniatis et al. Molecular Cloning: 
A Laboratory Manual (1982); DNA Cloning: A Practical Approach, vol. I & II (D. 

35 Glover, ed.); Oligonucleotide Synthesis (N. Gait, ed., 1984); Nucleic Acid 
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Hybridization (B. Hames & S. Higgins, eds., 1985); Transcription and Translation (B. 
Hames & S. Higgins, eds., 1984); Animal Cell Culture (R. Freshney, ed., 1986); Perbal, 
A Practical Guide to Molecular Cloning (1984). 

All publications, patents and patent applications cited herein, whether 
5 supra or infra, are hereby incorporated by reference in their entirety. 

As used in this specification and the appended claims, the singular forms 
"a," "an" and "the" include plural references unless the content clearly dictates 
otherwise. 

Polypeptide Compositions 

10 As used herein, the term "polypeptide" " is used in its conventional 

meaning, i.e., as a sequence of amino acids. The polypeptides are not limited to a 
specific length of the product; thus, peptides, oligopeptides, and proteins are included 
within the definition of polypeptide, and such terms may be used interchangeably herein 
unless specifically indicated otherwise. This term also does not refer to or exclude post- 

15 expression modifications of the polypeptide, for example, glycosylations, acetylations, 
phosphorylations and the like, as well as other modifications known in the art, both 
naturally occurring and non-naturally occurring. A polypeptide may be an entire 
protein, or a subsequence thereof. Particular polypeptides of interest in the context of 
this invention are amino acid subsequences comprising epitopes, i.e., antigenic 

20 determinants substantially responsible for the immunogenic properties of a polypeptide 
and being capable of evoking an immune response. 

Particularly illustrative polypeptides of the present invention comprise 
those encoded by a polynucleotide sequence set forth in any one of SEQ ID NOs: 217- 
390, 392, 394, 396, 398-420 422-424, 428-433, 440-583 and 588-732, or a sequence 

25 that hybridizes under moderately stringent conditions, or, alternatively, under highly 
stringent conditions, to a polynucleotide sequence set forth in any one of SEQ ID NOs: 
217-390, 392, 394, 396, 398-420 422-424, 428-433, 440-583 and 588-732. Certain 
other illustrative polypeptides of the invention comprise amino acid sequences as set 
forth in any one of SEQ ID NOs: 391, 393, 395, 397, 421, 425-427, 434-439, 584-587 

30 and. 

The polypeptides of the present invention are sometimes herein referred 
to as lung tumor proteins or lung tumor polypeptides, as an indication that their 
identification has been based at least in part upon their increased levels of expression in 
lung tumor samples. Thus, a "lung tumor polypeptide" or "lung tumor protein," refers 
35 generally to a polypeptide sequence of the present invention, or a polynucleotide 
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sequence encoding such a polypeptide, that is expressed in a substantial proportion of 
lung tumor samples, for example preferably greater than about 20%, more preferably 
greater than about 30%, and most preferably greater than about 50% or more of lung 
tumor samples tested, at a level that is at least two fold, and preferably at least five fold, 
5 greater than the level of expression in normal tissues, as determined using a 
representative assay provided herein. A lung tumor polypeptide sequence of the 
invention, based upon its increased level of expression in tumor cells, has particular 
utility both as a diagnostic marker as well as a therapeutic target, as further described 
below. 

10 In certain preferred embodiments, the polypeptides of the invention are 

immunogenic, i.e., they react detectably within an immunoassay (such as an ELISA or 
T-cell stimulation assay) with antisera and/or T-cells from a patient with lung cancer. 
Screening for immunogenic activity can be performed using techniques well known to 
the skilled artisan. For example, such screens can be performed using methods such as 

15 those described in Harlow and Lane, Antibodies: A Laboratory Manual, Cold Spring 
Harbor Laboratory, 1988. In one illustrative example, a polypeptide may be 
immobilized on a solid support and contacted with patient sera to allow binding of 
antibodies within the sera to the immobilized polypeptide. Unbound sera may then be 
removed and bound antibodies detected using, for example, 125 I-labeled Protein A. 

20 As would be recognized by the skilled artisan, immunogenic portions of 

the polypeptides disclosed herein are also encompassed by the present invention. An 
"immunogenic portion," as used herein, is a fragment of an immunogenic polypeptide 
of the invention that itself is immunologically reactive (i.e., specifically binds) with the 
B-cells and/or T-cell surface antigen receptors that recognize the polypeptide. 

25 Immunogenic portions may generally be identified using well known techniques, such 
as those summarized in Paul, Fundamental Immunology, 3rd ed., 243-247 (Raven Press, 
1993) and references cited therein. Such techniques include screening polypeptides for 
the ability to react with antigen-specific antibodies, antisera and/or T-cell lines or 
clones. As used herein, antisera and antibodies are "antigen-specific" if they 

30 specifically bind to an antigen (i.e., they react with the protein in an ELISA or other 
immunoassay, and do not react detectably with unrelated proteins). Such antisera and 
antibodies may be prepared as described herein, and using well-known techniques. 

In one preferred embodiment, an immunogenic portion of a polypeptide 
of the present invention is a portion that reacts with antisera and/or T-cells at a level that 

35 is not substantially less than the reactivity of the full-length polypeptide (e.g., in an 
ELISA and/or T-cell reactivity assay). Preferably, the level of immunogenic activity of 
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the immunogenic portion is at least about 50%, preferably at least about 70% and most 
preferably greater than about 90% of the immunogenicity for the full-length 
polypeptide. In some instances, preferred immunogenic portions will be identified that 
have a level of immunogenic activity greater than that of the corresponding full-length 
5 polypeptide, e.g., having greater than about 100% or 150% or more immunogenic 
activity. 

In certain other embodiments, illustrative immunogenic portions may 
include peptides in which an N-terminal leader sequence and/or transmembrane domain 
have been deleted. Other illustrative immunogenic portions will contain a small N- 

10 and/or C-terminal deletion (e.g., 1-30 amino acids, preferably 5-15 amino acids), 
relative to the mature protein. 

In another embodiment, a polypeptide composition of the invention may 
also comprise one or more polypeptides that are immunologically reactive with T cells 
and/or antibodies generated against a polypeptide of the invention, particularly a 

15 polypeptide having an amino acid sequence disclosed herein, or to an immunogenic 
fragment or variant thereof. 

In another embodiment of the invention, polypeptides are provided that 
comprise one or more polypeptides that are capable of eliciting T cells and/or antibodies 
that are immunologically reactive with one or more polypeptides described herein, or 

20 one or more polypeptides encoded by contiguous nucleic acid sequences contained in 
the polynucleotide sequences disclosed herein, or immunogenic fragments or variants 
thereof, or to one or more nucleic acid sequences which hybridize to one or more of 
these sequences under conditions of moderate to high stringency. 

The present invention, in another aspect, provides polypeptide fragments 

25 comprising at least about 5, 10, 15, 20, 25, 50, or 100 contiguous amino acids, or more, 
including all intermediate lengths, of a polypeptide compositions set forth herein, such 
as those set forth in SEQ ID NOs: 391, 393, 395, 397, 421, 425-427, 434-439, 584-587, 
738, 739, 742, 745 and/or those encoded by a polynucleotide sequence set forth in a 
sequence of SEQ ID NOs: 217-390, 392, 394, 396, 398-420 422-424, 428-433, 440- 

30 583, 588-732, 736, 737, 740, 741, 744 and 746. 

In another aspect, the present invention provides variants of the 
polypeptide compositions described herein. Polypeptide variants generally 
encompassed by the present invention will typically exhibit at least about 70%, 75%, 
80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% or more identity 

35 (determined as described below), along its length, to a polypeptide sequences set forth 
herein. 
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In one preferred embodiment, the polypeptide fragments and variants 
provide by the present invention are immunologically reactive with an antibody and/or 
T-cell that reacts with a full-length polypeptide specifically set for the herein. 

In another preferred embodiment, the polypeptide fragments and variants 
5 provided by the present invention exhibit a level of immunogenic activity of at least 
about 50%, preferably at least about 70%, and most preferably at least about 90% or 
more of that exhibited by a full-length polypeptide sequence specifically set forth 
herein.A polypeptide "variant," as the term is used herein, is a polypeptide that typically 
differs from a polypeptide specifically disclosed herein in one or more substitutions, 

10 deletions, additions and/or insertions. Such variants may be naturally occurring or may 
be synthetically generated, for example, by modifying one or more of the above 
polypeptide sequences of the invention and evaluating their immunogenic activity as 
described herein and/or using any of a number of techniques well known in the art. 

For example, certain illustrative variants of the polypeptides of the 

15 invention include those in which one or more portions, such as an N-terminal leader 
sequence or transmembrane domain, have been removed. Other illustrative variants 
include variants in which a small portion (e.g., 1-30 amino acids, preferably 5-15 amino 
acids) has been removed from the N- and/or C-terminal of the mature protein. 

In many instances, a variant will contain conservative substitutions. A 

20 "conservative substitution" is one in which an amino acid is substituted for another 
amino acid that has similar properties, such that one skilled in the art of peptide 
chemistry would expect the secondary structure and hydropathic nature of the 
polypeptide to be substantially unchanged. As described above, modifications may be 
made in the structure of the polynucleotides and polypeptides of the present invention 

25 and still obtain a functional molecule that encodes a variant or derivative polypeptide 
with desirable characteristics, e.g., with immunogenic characteristics. When it is 
desired to alter the amino acid sequence of a polypeptide to create an equivalent, or 
even an improved, immunogenic variant or portion of a polypeptide of the invention, 
one skilled in the art will typically change one or more of the codons of the encoding 

30 DNA sequence according to Table 1 . 

For example, certain amino acids may be substituted for other amino 
acids in a protein structure without appreciable loss of interactive binding capacity with 
structures such as, for example, antigen-binding regions of antibodies or binding sites 
on substrate molecules. Since it is the interactive capacity and nature of a protein that 

35 defines that protein's biological functional activity, certain amino acid sequence 
substitutions can be made in a protein sequence, and, of course, its underlying DNA 
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coding sequence, and nevertheless obtain a protein with like properties. It is thus 
contemplated that various changes may be made in the peptide sequences of the 
disclosed compositions, or corresponding DNA sequences which encode said peptides 
without appreciable loss of their biological utility or activity. 

5 

Table 1 



Amino Acids Codons 



Alanine 


Ala 


A 


GCA 


GCC 


GCG 


GCU 






Cysteine 


Cys 


C 


UGC 


UGU 










Aspartic acid 


Asp 


D 


GAC 


GAU 










Glutamic acid 


Glu 


E 


GAA 


GAG 










Phenylalanine 


Phe 


F 


uuc 


uuu 










Glycine 


Gly 


G 


GGA 


GGC 


GGG 


GGU 






Histidine 


His 


H 


CAC 


CAU 










Isoleucine 


He 


I 


AUA 


AUC 


AUU 








Lysine 


Lys 


K 


AAA 


AAG 










Leucine 


Leu 


L 


UUA 


UUG 


CUA 


cue 


CUG 


CUU 


Methionine 


Met 


M 


AUG 












Asparagine 


Asn 


N 


AAC 


AAU 










Proline 


Pro 


P 


CCA 


ccc 


CCG 


ecu 






Glutamine 


Gin 


Q 


CAA 


CAG 










Arginine 


Arg 


R 


AGA 


AGG 


CGA 


CGC 


CGG 


CGU 


Serine 


Ser 


S 


AGC 


AGU 


UCA 


UCC 


UCG 


UCU 


Threonine 


Thr 


T 


ACA 


ACC 


ACG 


ACU 






Valine 


Val 


V 


GUA 


GUC 


GUG 


GUU 






Tryptophan 


Trp 


W 


UGG 












Tyrosine 


Tyr 


Y 


UAC 


UAU 











In making such changes, the hydropathic index of amino acids may be 
considered. The importance of the hydropathic amino acid index in conferring 
10 interactive biologic function on a protein is generally understood in the art (Kyte and 
Doolittle, 1982, incorporated herein by reference). It is accepted that the relative 
hydropathic character of the amino acid contributes to the secondary structure of the 
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resultant protein, which in turn defines the interaction of the protein with other 
molecules, for example, enzymes, substrates, receptors, DNA, antibodies, antigens, and 
the like. Each amino acid has been assigned a hydropathic index on the basis of its 
hydrophobicity and charge characteristics (Kyte and Doolittle, 1982). These values are: 
5 isoleucine (+4.5); valine (+4.2); leucine (+3.8); phenylalanine (+2.8); cysteine/cystine 
(+2.5); methionine (+1.9); alanine (+1.8); glycine (-0.4); threonine (-0.7); serine (-0.8); 
tryptophan (-0.9); tyrosine (-1.3); proline (-1.6); histidine (-3.2); glutamate (-3.5); 
glutamine (-3.5); aspartate (-3.5); asparagine (-3.5); lysine (-3.9); and arginine (-4.5). 

It is known in the art that certain amino acids may be substituted by other 

10 amino acids having a similar hydropathic index or score and still result in a protein with 
similar biological activity, i.e. still obtain a biological functionally equivalent protein. 
In making such changes, the substitution of amino acids whose hydropathic indices are 
within +2 is preferred, those within ±1 are particularly preferred, and those within ±0.5 
are even more particularly preferred. It is also understood in the art that the substitution 

15 of like amino acids can be made effectively on the basis of hydrophilicity. U. S. Patent 
4,554,101 (specifically incorporated herein by reference in its entirety), states that the 
greatest local average hydrophilicity of a protein, as governed by the hydrophilicity of 
its adjacent amino acids, correlates with a biological property of the protein. 

As detailed in U. S. Patent 4,554,101, the following hydrophilicity values 

20 have been assigned to amino acid residues: arginine (+3.0); lysine (+3.0); aspartate 
(+3.0 ± 1); glutamate (+3.0 ± 1); serine (+0.3); asparagine (+0.2); glutamine (+0.2); 
glycine (0); threonine (-0.4); proline (-0.5 + 1); alanine (-0.5); histidine (-0.5); cysteine 
(-1.0); methionine (-1.3); valine (-1.5); leucine (-1.8); isoleucine (-1.8); tyrosine (- 
2.3); phenylalanine (-2.5); tryptophan (-3.4). It is understood that an amino acid can be 

25 substituted for another having a similar hydrophilicity value and still obtain a 
biologically equivalent, and in particular, an immunologically equivalent protein. In 
such changes, the substitution of amino acids whose hydrophilicity values are within ±2 
is preferred, those within ±1 are particularly preferred, and those within +0.5 are even 
more particularly preferred. 

30 As outlined above, amino acid substitutions are generally therefore based 

on the relative similarity of the amino acid side-chain substituents, for example, their 
hydrophobicity, hydrophilicity, charge, size, and the like. Exemplary substitutions that 
take various of the foregoing characteristics into consideration are well known to those 
of skill in the art and include: arginine and lysine; glutamate and aspartate; serine and 

35 threonine; glutamine and asparagine; and valine, leucine and isoleucine. 
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In addition, any polynucleotide may be further modified to increase 
stability in vivo. Possible modifications include, but are not limited to, the addition of 
flanking sequences at the 5' and/or 3' ends; the use of phosphorothioate or 2' O-methyl 
rather than phosphodiesterase linkages in the backbone; and/or the inclusion of 
5 nontraditional bases such as inosine, queosine and wybutosine, as well as acetyl- 
methyl-, thio- and other modified forms of adenine, cytidine, guanine, thymine and 
uridine. 

Amino acid substitutions may further be made on the basis of similarity 
in polarity, charge, solubility, hydrophobicity, hydrophilicity and/or the amphipathic 

10 nature of the residues. For example, negatively charged amino acids include aspartic 
acid and glutamic acid; positively charged amino acids include lysine and arginine; and 
amino acids with uncharged polar head groups having similar hydrophilicity values 
include leucine, isoleucine and valine; glycine and alanine; asparagine and glutamine; 
and serine, threonine, phenylalanine and tyrosine. Other groups of amino acids that may 

15 represent conservative changes include: (l)ala, pro, gly, glu, asp, gin, asn, ser, thr; 
(2) cys, ser, tyr, thr; (3) val, ile, leu, met, ala, phe; (4) lys, arg, his; and (5) phe, tyr, trp, 
his. A variant may also, or alternatively, contain nonconservative changes. In a 
preferred embodiment, variant polypeptides differ from a native sequence by 
substitution, deletion or addition of five amino acids or fewer. Variants may also (or 

20 alternatively) be modified by, for example, the deletion or addition of amino acids that 
have minimal influence on the immunogenicity, secondary structure and hydropathic 
nature of the polypeptide. 

As noted above, polypeptides may comprise a signal (or leader) sequence 
at the N-terminal end of the protein, which co-translationally or post-translationally 

25 directs transfer of the protein. The polypeptide may also be conjugated to a linker or 
other sequence for ease of synthesis, purification or identification of the polypeptide 
(e.g., poly-His), or to enhance binding of the polypeptide to a solid support. For 
example, a polypeptide may be conjugated to an immunoglobulin Fc region. 

When comparing polypeptide sequences, two sequences are said to be 

30 "identical" if the sequence of amino acids in the two sequences is the same when 
aligned for maximum correspondence, as described below. Comparisons between two 
sequences are typically performed by comparing the sequences over a comparison 
window to identify and compare local regions of sequence similarity. A "comparison 
window" as used herein, refers to a segment of at least about 20 contiguous positions, 

35 usually 30 to about 75, 40 to about 50, in which a sequence may be compared to a 
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reference sequence of the same number of contiguous positions after the two sequences 
are optimally aligned. 

Optimal alignment of sequences for comparison may be conducted using 
the Megalign program in the Lasergene suite of bioinformatics software (DNASTAR, 
5 Inc., Madison, WI), using default parameters. This program embodies several 
alignment schemes described in the following references: Dayhoff, M.O. (1978) A 
model of evolutionary change in proteins - Matrices for detecting distant relationships. 
In Dayhoff, M.O. (ed.) Atlas of Protein Sequence and Structure, National Biomedical 
Research Foundation, Washington DC Vol. 5, Suppl. 3, pp. 345-358; Hein J. (1990) 

10 Unified Approach to Alignment and Phylogenes pp. 626-645 Methods in Enzymology 
vol. 183, Academic Press, Inc., San Diego, CA; Higgins, D.G. and Sharp, P.M. (1989) 
CABIOS 5:151-153; Myers, E.W. and Muller W. (1988) CABIOS 4:11-17; Robinson, 
E.D. (1971) Comb. Theor 77:105; Santou, N. Nes, M. (1987) Mol. Biol. Evol. 4:406- 
425; Sneath, P.H.A. and Sokal, R.R. (1973) Numerical Taxonomy - the Principles and 

15 Practice of Numerical Taxonomy, Freeman Press, San Francisco, CA; Wilbur, W.J. and 
Lipman, D.J. (1983) Proc. Natl. Acad., Set USA 80:726-730. 

Alternatively, optimal alignment of sequences for comparison may be 
conducted by the local identity algorithm of Smith and Waterman (1981) Add. APL. 
Math 2:482, by the identity alignment algorithm of Needleman and Wunsch (1970) J. 

20 Mol. Biol. 48:443, by the search for similarity methods of Pearson and Lipman (1988) 
Proc. Natl. Acad. Sci. USA 85: 2444, by computerized implementations of these 
algorithms (GAP, BESTFIT, BLAST, FASTA, and TFASTA in the Wisconsin Genetics 
Software Package, Genetics Computer Group (GCG), 575 Science Dr., Madison, WI), 
or by inspection. 

25 One preferred example of algorithms that are suitable for determining 

percent sequence identity and sequence similarity are the BLAST and BLAST 2.0 
algorithms, which are described in Altschul et al. (1977) Nucl. Acids Res. 25:3389-3402 
and Altschul et al. (1990) J. Mol. Biol. 215:403-410, respectively. BLAST and BLAST 
2.0 can be used, for example with the parameters described herein, to determine percent 

30 sequence identity for the polynucleotides and polypeptides of the invention. Software 
for performing BLAST analyses is publicly available through the National Center for 
Biotechnology Information. For amino acid sequences, a scoring matrix can be used to 
calculate the cumulative score. Extension of the word hits in each direction are halted 
when: the cumulative alignment score falls off by the quantity X from its maximum 

35 achieved value; the cumulative score goes to zero or below, due to the accumulation of 
one or more negative-scoring residue alignments; or the end of either sequence is 
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reached. The BLAST algorithm parameters W, T and X determine the sensitivity and 
speed of the alignment. 

In one preferred approach, the "percentage of sequence identity" is 
determined by comparing two optimally aligned sequences over a window of 
5 comparison of at least 20 positions, wherein the portion of the polypeptide sequence in 
the comparison window may comprise additions or deletions (i.e., gaps) of 20 percent 
or less, usually 5 to 15 percent, or 10 to 12 percent, as compared to the reference 
sequences (which does not comprise additions or deletions) for optimal alignment of the 
two sequences. The percentage is calculated by determining the number of positions at 

10 which the identical amino acid residue occurs in both sequences to yield the number of 
matched positions, dividing the number of matched positions by the total number of 
positions in the reference sequence (i.e., the window size) and multiplying the results by 
100 to yield the percentage of sequence identity. 

Within other illustrative embodiments, a polypeptide may be a fusion 

15 polypeptide that comprises multiple polypeptides as described herein, or that comprises 
at least one polypeptide as described herein and an unrelated sequence, such as a known 
tumor protein. A fusion partner may, for example, assist in providing T helper epitopes 
(an immunological fusion partner), preferably T helper epitopes recognized by humans, 
or may assist in expressing the protein (an expression enhancer) at higher yields than the 

20 native recombinant protein. Certain preferred fusion partners are both immunological 
and expression enhancing fusion partners. Other fusion partners may be selected so as 
to increase the solubility of the polypeptide or to enable the polypeptide to be targeted to 
desired intracellular compartments. Still further fusion partners include affinity tags, 
which facilitate purification of the polypeptide. 

25 Fusion polypeptides may generally be prepared using standard 

techniques, including chemical conjugation. Preferably, a fusion polypeptide, is 
expressed as a recombinant polypeptide, allowing the production of increased levels, 
relative to a non-fused polypeptide, in an expression system. Briefly, DNA sequences 
encoding the polypeptide components may be assembled separately, and ligated into an 

30 appropriate expression vector. The 3* end of the DNA sequence encoding one 
polypeptide component is ligated, with or without a peptide linker, to the 5' end of a 
DNA sequence encoding the second polypeptide component so that the reading frames 
of the sequences are in phase. This permits translation into a single fusion polypeptide 
that retains the biological activity of both component polypeptides. 

35 A peptide linker sequence may be employed to separate the first and 

second polypeptide components by a distance sufficient to ensure that each polypeptide 
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folds into its secondary and tertiary structures. Such a peptide linker sequence is 
incorporated into the fusion polypeptide using standard techniques well known in the 
art. Suitable peptide linker sequences may be chosen based on the following factors: 
(1) their ability to adopt a flexible extended conformation; (2) their inability to adopt a 
5 secondary structure that could interact with functional epitopes on the first and second 
polypeptides; and (3) the lack of hydrophobic or charged residues that might react with 
the polypeptide functional epitopes. Preferred peptide linker sequences contain Gly, 
Asn and Ser residues. Other near neutral amino acids, such as Thr and Ala may also be 

, used in the linker sequence. Amino acid sequences which may be usefully employed as 

10 linkers include those disclosed in Maratea etal., Gene 40:39-46, 1985; Murphy etal, 
Proc. Natl. Acad. Sci. USA 55:8258-8262, 1986; U.S. Patent No. 4,935,233 and U.S. 
Patent No. 4,751,180. The linker sequence may generally be from 1 to about 50 amino 
acids in length. Linker sequences are not required when the first and second 
polypeptides have non-essential N-terminal amino acid regions that can be used to 

15 separate the functional domains and prevent steric interference. 

The ligated DNA sequences are operably linked to suitable 
transcriptional or translational regulatory elements. The regulatory elements 
responsible for expression of DNA are located only 5' to the DNA sequence encoding 
the first polypeptides. Similarly, stop codons required to end translation and 

20 transcription termination signals are only present 3' to the DNA sequence encoding the 
second polypeptide. 

The fusion polypeptide can comprise a polypeptide as described herein 
together with an unrelated immunogenic protein, such as an immunogenic protein 
capable of eliciting a recall response. Examples of such proteins include tetanus, 

25 tuberculosis and hepatitis proteins (see, for example, Stoute et al. New Engl. J. Med., 
336:86-91, 1997). 

In one preferred embodiment, the immunological fusion partner is 
derived from a Mycobacterium sp., such as a Mycobacterium tuberculosis-derived Ral2 
fragment. Ral2 compositions and methods for their use in enhancing the expression 

30 and/or immunogenicity of heterologous polynucleotide/polypeptide sequences is 
described in U.S. Patent Application 60/158,585, the disclosure of which is 
incorporated herein by reference in its entirety. Briefly, Ral2 refers to a polynucleotide 
region that is a subsequence of a Mycobacterium tuberculosis MTB32A nucleic acid. 
MTB32A is a serine protease of 32 KD molecular weight encoded by a gene in virulent 

35 and avirulent strains of M. tuberculosis. The nucleotide sequence and amino acid 
sequence of MTB32A have been described (for example, U.S. Patent Application 
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60/158,585; see also, Skeiky et al, Infection and Jmmun. (1999) 67:3998-4007, 
incorporated herein by reference). C-terminal fragments of the MTB32A coding 
sequence express at high levels and remain as a soluble polypeptides throughout the 
purification process. Moreover, Ral2 may enhance the immunogenicity of heterologous 
5 immunogenic polypeptides with which it is fused. One preferred Ral2 fusion 
polypeptide comprises a 14 KD C-terminal fragment corresponding to amino acid 
residues 192 to 323 of MTB32A. Other preferred Ral2 polynucleotides generally 
comprise at least about 15 consecutive nucleotides, at least about 30 nucleotides, at least 
about 60 nucleotides, at least about 100 nucleotides, at least about 200 nucleotides, or at 

10 least about 300 nucleotides that encode a portion of a Ral2 polypeptide. Ral2 
polynucleotides may comprise a native sequence (i.e., an endogenous sequence that 
encodes a Ral2 polypeptide or a portion thereof) or may comprise a variant of such a 
sequence. Ral2 polynucleotide variants may contain one or more substitutions, 
additions, deletions and/or insertions such that the biological activity of the encoded 

15 fusion polypeptide is not substantially diminished, relative to a fusion polypeptide 
comprising a native Ral2 polypeptide. Variants preferably exhibit at least about 70% 
identity, more preferably at least about 80% identity and most preferably at least about 
90% identity to a polynucleotide sequence that encodes a native Ral2 polypeptide or a 
portion thereof. 

20 Within other preferred embodiments, an immunological fusion partner is 

derived from protein D, a surface protein of the gram-negative bacterium Haemophilus 
influenza B (WO 91/18926). Preferably, a protein D derivative comprises 
approximately the first third of the protein (e.g., the first N-terminal 100-110 amino 
acids), and a protein D derivative may be lipidated. Within certain preferred 

25 embodiments, the first 109 residues of a Lipoprotein D fusion partner is included on the 
N-terminus to provide the polypeptide with additional exogenous T-cell epitopes and to 
increase the expression level in E. coli (thus functioning as an expression enhancer). 
The lipid tail ensures optimal presentation of the antigen to antigen presenting cells. 
Other fusion partners include the non-structural protein from influenzae virus, NS1 

30 (hemaglutinin). Typically, the N-terrninal 81 amino acids are used, although different 
fragments that include T-helper epitopes may be used. 

In another embodiment, the immunological fusion partner is the protein 
known as LYTA, or a portion thereof (preferably a C-terminal portion). LYTA is 
derived from Streptococcus pneumoniae, which synthesizes an N-acetyl-L-alanine 

35 amidase known as amidase LYTA (encoded by the LytA gene; Gene 43:265-292, 1986). 
LYTA is an autolysin that specifically degrades certain bonds in the peptidoglycan 
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backbone. The C-terminal domain of the LYTA protein is responsible for the affinity to 
the choline or to some choline analogues such as DEAE. This property has been 
exploited for the development of E. coli C-LYTA expressing plasmids useful for 
expression of fusion proteins. Purification of hybrid proteins containing the C-LYTA 
5 fragment at the amino terminus has been described {see Biotechnology 70:795-798, 
1992). Within a preferred embodiment, a repeat portion of LYTA may be incorporated 
into a fusion polypeptide. A repeat portion is found in the C-terminal region starting at 
residue 178. A particularly preferred repeat portion incorporates residues 188-305. 

Yet another illustrative embodiment involves fusion polypeptides, and 

10 the polynucleotides encoding them, wherein the fusion partner comprises a targeting 
signal capable of directing a polypeptide to the endosomal/lysosomal compartment, as 
described in U.S. Patent No. 5,633,234. An immunogenic polypeptide of the invention, 
when fused with this targeting signal, will associate more efficiently with MHC class II 
molecules and thereby provide enhanced in vivo stimulation of CD4 + T-cells specific 

15 for the polypeptide. 

Polypeptides of the invention are prepared using any of a variety of well 
known synthetic and/or recombinant techniques, the latter of which are further 
described below. Polypeptides, portions and other variants generally less than about 
150 amino acids can be generated by synthetic means, using techniques well known to 

20 those of ordinary skill in the art. In one illustrative example, such polypeptides are 
synthesized using any of the commercially available solid-phase techniques, such as the 
Merrifield solid-phase synthesis method, where amino acids are sequentially added to a 
growing amino acid chain. See Merrifield, J. Am. Chem. Soc. 85:2149-2146, 1963. 
Equipment for automated synthesis of polypeptides is commercially available from 

25 suppliers such as Perkin Elmer/Applied BioSystems Division (Foster City, CA), and 
may be operated according to the manufacturer's instructions. 

In general, polypeptide compositions (including fusion polypeptides) of 
the invention are isolated. An "isolated" polypeptide is one that is removed from its 
original environment. For example, a naturally-occurring protein or polypeptide is 

30 isolated if it is separated from some or all of the coexisting materials in the natural 
system. Preferably, such polypeptides are also purified, e.g., are at least about 90% 
pure, more preferably at least about 95% pure and most preferably at least about 99% 
pure. 
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Polynucleotide Compositions 

The present invention, in other aspects, provides polynucleotide 
compositions. The terms "DNA" and "polynucleotide" are used essentially 
interchangeably herein to refer to a DNA molecule that has been isolated free of total 
5 genomic DNA of a particular species. "Isolated," as used herein, means that a 
polynucleotide is substantially away from other coding sequences, and that the DNA 
molecule does not contain large portions of unrelated coding DNA, such as large 
chromosomal fragments or other functional genes or polypeptide coding regions. Of 
course, this refers to the DNA molecule as originally isolated, and does not exclude 

10 genes or coding regions later added to the segment by the hand of man. 

As will be understood by those skilled in the art, the polynucleotide 
compositions of this invention can include genomic sequences, extra-genomic and 
plasmid-encoded sequences and smaller engineered gene segments that express, or may 
be adapted to express, proteins, polypeptides, peptides and the like. Such segments may 

15 be naturally isolated, or modified synthetically by the hand of man. 

As will be also recognized by the skilled artisan, polynucleotides of the 
invention may be single-stranded (coding or antisense) or double-stranded, and may be 
DNA (genomic, cDNA or synthetic) or RNA molecules. RNA molecules may include 
HnRNA molecules, which contain introns and correspond to a DNA molecule in a one- 

20 to-one manner, and mRNA molecules, which do not contain introns. Additional coding 
or non-coding sequences may, but need not, be present within a polynucleotide of the 
present invention, and a polynucleotide may, but need not, be linked to other molecules 
and/or support materials. 

Polynucleotides may comprise a native sequence (i.e., an endogenous 

25 sequence that encodes a polypeptide/protein of the invention or a portion thereof) or 
may comprise a sequence that encodes a variant or derivative, preferably and 
immunogenic variant or derivative, of such a sequence. 

Therefore, according to another aspect of the present invention, 
polynucleotide compositions are provided that comprise some or all of a polynucleotide 

30 sequence set forth in any one of SEQ ID NOs: 217-390, 392, 394, 396, 398-420 422- 
424, 428-433, 440-583, 588-732, 736, 737, 740, 741, 744 and 746, complements of a 
polynucleotide sequence set forth in any one of SEQ ID NOs: 217-390, 392, 394, 396, 
398-420 422-424, 428-433, 440-583, 588-732, 736, 737, 740, 741, 744 and 746, and 
degenerate variants of a polynucleotide sequence set forth in any one of SEQ ID NOs: 

35 217-390, 392, 394, 396, 398-420 422-424, 428-433, 440-583, 588-732, 736, 737, 740, 
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741, 744 and 746. In certain preferred embodiments, the polynucleotide sequences set 
forth herein encode immunogenic polypeptides, as described above. 

In other related embodiments, the present invention provides 
polynucleotide variants having substantial identity to the sequences disclosed herein in 
5 SEQ ID NOs: 217-390, 392, 394, 396, 398-420 422-424, 428-433, 440-583, 588-732, 
736, 737, 740, 741, 744 and 746, for example those comprising at least 70% sequence 
identity, preferably at least 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or 
higher, sequence identity compared to a polynucleotide sequence of this invention using 
the methods described herein, (e.g., BLAST analysis using standard parameters, as 

10 described below). One skilled in this art will recognize that these values can be 
appropriately adjusted to determine corresponding identity of proteins encoded by two 
nucleotide sequences by taking into account codon degeneracy, amino acid similarity, 
reading frame positioning and the like. 

Typically, polynucleotide variants will contain one or more substitutions, 

15 additions, deletions and/or insertions, preferably such that the immunogenicity of the 
polypeptide encoded by the variant polynucleotide is not substantially diminished 
relative to a polypeptide encoded by a polynucleotide sequence specifically set forth 
herein). The term "variants" should also be understood to encompasses homologous 
genes of xenogenic origin. 

20 In additional embodiments, the present invention provides 

polynucleotide fragments comprising various lengths of contiguous stretches of 
sequence identical to or complementary to one or more of the sequences disclosed 
herein. For example, polynucleotides are provided by this invention that comprise at 
least about 10, 15, 20, 30, 40, 50, 75, 100, 150, 200, 300, 400, 500 or 1000 or more 

25 contiguous nucleotides of one or more of the sequences disclosed herein as well as all 
intermediate lengths there between. It will be readily understood that "intermediate 
lengths", in this context, means any length between the quoted values, such as 16, 17, 
18, 19, etc.; 21, 22, 23, etc.; 30, 31, 32, etc.; 50, 51, 52, 53, etc.; 100, 101, 102, 103, 
etc.; 150, 151, 152, 153, etc.; including all integers through 200-500; 500-1,000, and the 

30 like. 

In another embodiment of the invention, polynucleotide compositions are 
provided that are capable of hybridizing under moderate to high stringency conditions to 
a polynucleotide sequence provided herein, or a fragment thereof, or a complementary 
sequence thereof. Hybridization techniques are well known in the art of molecular 
35 biology. For purposes of illustration, suitable moderately stringent conditions for 
testing the hybridization of a polynucleotide of this invention with other polynucleotides 
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include prewashing in a solution of 5 X SSC, 0.5% SDS, 1.0 mM EDTA (pH 8.0); 
hybridizing at 50°C-60°C, 5 X SSC, overnight; followed by washing twice at 65°C for 
20 minutes with each of 2X 3 0.5X and 0.2X SSC containing 0.1% SDS. One skilled in 
the art will understand that the stringency of hybridization can be readily manipulated, 
5 such as by altering the salt content of the hybridization solution and/or the temperature 
at which the hybridization is performed. For example, in another embodiment, suitable 
highly stringent hybridization conditions include those described above, with the 
exception that the temperature of hybridization is increased, e.g., to 60-65°C or 65- 
70°C. 

10 In certain preferred embodiments, the polynucleotides described above, 

e.g., polynucleotide variants, fragments and hybridizing sequences, encode polypeptides 
that are immunologically cross-reactive with a polypeptide sequence specifically set 
forth herein. In other preferred embodiments, such polynucleotides encode 
polypeptides that have a level of immunogenic activity of at least about 50%, preferably 

15 at least about 70%, and more preferably at least about 90% of that for a polypeptide 
sequence specifically set forth herein. 

The polynucleotides of the present invention, or fragments thereof, 
regardless of the length of the coding sequence itself, may be combined with other DNA 
sequences, such as promoters, polyadenylation signals, additional restriction enzyme 

20 sites, multiple cloning sites, other coding segments, and the like, such that their overall 
length may vary considerably. It is therefore contemplated that a nucleic acid fragment 
of almost any length may be employed, with the total length preferably being limited by 
the ease of preparation and use in the intended recombinant DNA protocol. For 
example, illustrative polynucleotide segments with total lengths of about 10,000, about 

25 5000, about 3000, about 2,000, about 1,000, about 500, about 200, about 100, about 50 
base pairs in length, and the like, (including all intermediate lengths) are contemplated 
to be useful in many implementations of this invention. 

When comparing polynucleotide sequences, two sequences are said to be 
"identical" if the sequence of nucleotides in the two sequences is the same when aligned 

30 for maximum correspondence, as described below. Comparisons between two 
sequences are typically performed by comparing the sequences over a comparison 
window to identify and compare local regions of sequence similarity. A "comparison 
window" as used herein, refers to a segment of at least about 20 contiguous positions, 
usually 30 to about 75, 40 to about 50, in which a sequence may be compared to a 

35 reference sequence of the same number of contiguous positions after the two sequences 
are optimally aligned. 
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Optimal alignment of sequences for comparison may be conducted using 
the Megalign program in the Lasergene suite of bioinformatics software (DNASTAR, 
Inc., Madison, WI), using default parameters. This program embodies several 
alignment schemes described in the following references: Dayhoff, M.O. (1978) A 
5 model of evolutionary change in proteins - Matrices for detecting distant relationships. 
In Dayhoff, M.O. (ed.) Atlas of Protein Sequence and Structure, National Biomedical 
Research Foundation, Washington DC Vol. 5, Suppl. 3, pp. 345-358; Hein J. (1990) 
Unified Approach to Alignment and Phylogenes pp. 626-645 Methods in Enzymology 
vol. 183, Academic Press, Inc., San Diego, CA; Higgins, D.G. and Sharp, P.M. (1989) 

10 CABIOS 5:151-153; Myers, E.W. and Muller W. (1988) CABIOS 4:11-17; Robinson, 
E.D. (1971) Comb. Theor 11:105; Santou, N. Nes, M. (1987) Mol. Biol. Evol. 4:406- 
425; Sneath, P.H.A. and Sokal, R.R. (1973) Numerical Taxonomy - the Principles and 
Practice of Numerical Taxonomy, Freeman Press, San Francisco, CA; Wilbur, W.J. and 
Lipman, D.J. (1983) Proc. Natl. Acad., Sci. USA 80:726-730. 

15 Alternatively, optimal alignment of sequences for comparison may be 

conducted by the local identity algorithm of Smith and Waterman (1981) Add. APL. 
Math 2:482, by the identity alignment algorithm of Needleman and Wunsch (1970) J. 
Mol. Biol. 48:443, by the search for similarity methods of Pearson and Lipman (1988) 
Proc. Natl. Acad. Sci. USA 85: 2444, by computerized implementations of these 

20 algorithms (GAP, BESTFIT, BLAST, FASTA, and TFASTA in the Wisconsin Genetics 
Software Package, Genetics Computer Group (GCG), 575 Science Dr., Madison, WI), 
or by inspection. 

One preferred example of algorithms that are suitable for determining 
percent sequence identity and sequence similarity are the BLAST and BLAST 2.0 

25 algorithms, which are described in Altschul et al. (1977) Nucl. Acids Res. 25:3389-3402 
and Altschul et al. (1990) J. Mol. Biol. 215:403-410, respectively. BLAST and BLAST 
2.0 can be used, for example with the parameters described herein, to determine percent 
sequence identity for the polynucleotides of the invention. Software for performing 
BLAST analyses is publicly available through the National Center for Biotechnology 

30 Information. In one illustrative example, cumulative scores can be calculated using, for 
nucleotide sequences, the parameters M (reward score for a pair of matching residues; 
always >0) and N (penalty score for mismatching residues; always <0). Extension of 
the word hits in each direction are halted when: the cumulative alignment score falls off 
by the quantity X from its maximum achieved value; the cumulative score goes to zero 

35 or below, due to the accumulation of one or more negative-scoring residue alignments; 
or the end of either sequence is reached. The BLAST algorithm parameters W, T and X 
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determine the sensitivity and speed of the alignment. The BLASTN program (for 
nucleotide sequences) uses as defaults a wordlength (W) of 11, and expectation (E) of 
10, and the BLOSUM62 scoring matrix (see Henikoff and Henikoff (1989) Proc. Natl. 
Acad. Set USA 89:10915) alignments, (B) of 50, expectation (E) of 10, M=5, N=-4 and 
5 a comparison of both strands. 

Preferably, the "percentage of sequence identity" is determined by 
comparing two optimally aligned sequences over a window of comparison of at least 20 
positions, wherein the portion of the polynucleotide sequence in the comparison 
window may comprise additions or deletions (i.e., gaps) of 20 percent or less, usually 5 

10 to 15 percent, or 10 to 12 percent, as compared to the reference sequences (which does 
not comprise additions or deletions) for optimal alignment of the two sequences. The 
percentage is calculated by determining the number of positions at which the identical 
nucleic acid bases occurs in both sequences to yield the number of matched positions, 
dividing the number of matched positions by the total number of positions in the 

15 reference sequence (i.e., the window size) and multiplying the results by 100 to yield the 
percentage of sequence identity. 

It will be appreciated by those of ordinary skill in the art that, as a result 
of the degeneracy of the genetic code, there are many nucleotide sequences that encode 
a polypeptide as described herein. Some of these polynucleotides bear minimal 

20 homology to the nucleotide sequence of any native gene. Nonetheless, polynucleotides 
that vary due to differences in codon usage are specifically contemplated by the present 
invention. Further, alleles of the genes comprising the polynucleotide sequences 
provided herein are within the scope of the present invention. Alleles are endogenous 
genes that are altered as a result of one or more mutations, such as deletions, additions 

25 and/or substitutions of nucleotides. The resulting mRNA and protein may, but need not, 
have an altered structure or function. Alleles may be identified using standard 
techniques (such as hybridization, amplification and/or database sequence comparison). 

Therefore, in another embodiment of the invention, a mutagenesis 
approach, such as site-specific mutagenesis, is employed for the preparation of 

30 immunogenic variants and/or derivatives of the polypeptides described herein. By this 
approach, specific modifications in a polypeptide sequence can be made through 
mutagenesis of the underlying polynucleotides that encode them. These techniques 
provides a straightforward approach to prepare and test sequence variants, for example, 
incorporating one or more of the foregoing considerations, by introducing one or more 

35 nucleotide sequence changes into the polynucleotide. 
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Site-specific mutagenesis allows the production of mutants through the 
use of specific oligonucleotide sequences which encode the DNA sequence of the 
desired mutation, as well as a sufficient number of adjacent nucleotides, to provide a 
primer sequence of sufficient size and sequence complexity to form a stable duplex on 
5 both sides of the deletion junction being traversed. Mutations may be employed in a 
selected polynucleotide sequence to improve, alter, decrease, modify, or otherwise 
change the properties of the polynucleotide itself, and/or alter the properties, activity, 
composition, stability, or primary sequence of the encoded polypeptide. 

In certain embodiments of the present invention, the inventors 

10 contemplate the mutagenesis of the disclosed polynucleotide sequences to alter one or 
more properties of the encoded polypeptide, such as the immunogenicity of a 
polypeptide vaccine. The techniques of site-specific mutagenesis are well known in the 
art, and are widely used to create variants of both polypeptides and polynucleotides. For 
example, site-specific mutagenesis is often used to alter a specific portion of a DNA 

15 molecule. In such embodiments, a primer comprising typically about 14 to about 25 
nucleotides or so in length is employed, with about 5 to about 10 residues on both sides 
of the junction of the sequence being altered. 

As will be appreciated by those of skill in the art, site-specific 
mutagenesis techniques have often employed a phage vector that exists in both a single 

20 stranded and double stranded form. Typical vectors useful in site-directed mutagenesis 
include vectors such as the Ml 3 phage. These phage are readily commercially-available 
and their use is generally well-known to those skilled in the art. Double-stranded 
plasmids are also routinely employed in site directed mutagenesis that eliminates the 
step of transferring the gene of interest from a plasmid to a phage. 

25 In general, site-directed mutagenesis in accordance herewith is 

performed by first obtaining a single-stranded vector or melting apart of two strands of a 
double-stranded vector that includes within its sequence a DNA sequence that encodes 
the desired peptide. An oligonucleotide primer bearing the desired mutated sequence is 
prepared, generally synthetically. This primer is then annealed with the single-stranded 

30 vector, and subjected to DNA polymerizing enzymes such as E. coli polymerase I 
Klenow fragment, in order to complete the synthesis of the mutation-bearing strand. 
Thus, a heteroduplex is formed wherein one strand encodes the original non-mutated 
sequence and the second strand bears the desired mutation. This heteroduplex vector is 
then used to transform appropriate cells, such as E. coli cells, and clones are selected 

35 which include recombinant vectors bearing the mutated sequence arrangement. 
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The preparation of sequence variants of the selected peptide-encoding 
DNA segments using site-directed mutagenesis provides a means of producing 
potentially useful species and is not meant to be limiting as there are other ways in 
which sequence variants of peptides and the DNA sequences encoding them may be 
5 obtained. For example, recombinant vectors encoding the desired peptide sequence 
may be treated with mutagenic agents, such as hydroxylamine, to obtain sequence 
variants. Specific details regarding these methods and protocols are found in the 
teachings of Maloy et a!., 1994; Segal, 1976; Prokop and Bajpai, 1991; Kuby, 1994; and 
Maniatis et al, 1982, each incorporated herein by reference, for that purpose. 

10 As used herein, the term "oligonucleotide directed mutagenesis 

procedure" refers to template-dependent processes and vector-mediated propagation 
which result in an increase in the concentration of a specific nucleic acid molecule 
relative to its initial concentration, or in an increase in the concentration of a detectable 
signal, such as amplification. As used herein, the term "oligonucleotide directed 

15 mutagenesis procedure" is intended to refer to a process that involves the 
template-dependent extension of a primer molecule. The term template dependent 
process refers to nucleic acid synthesis of an RNA or a DNA molecule wherein the 
sequence of the newly synthesized strand of nucleic acid is dictated by the well-known 
rules of complementary base pairing (see, for example, Watson, 1987). Typically, 

20 vector mediated methodologies involve the introduction of the nucleic acid fragment 
into a DNA or RNA vector, the clonal amplification of the vector, and the recovery of 
the amplified nucleic acid fragment. Examples of such methodologies are provided by 
U. S. Patent No. 4,237,224, specifically incorporated herein by reference in its entirety. 

In another approach for the production of polypeptide variants of the 

25 present invention, recursive sequence recombination, as described in U.S. Patent No. 
5,837,458, may be employed. In this approach, iterative cycles of recombination and 
screening or selection are performed to "evolve" individual polynucleotide variants of 
the invention having, for example, enhanced immunogenic activity. 

In other embodiments of the present invention, the polynucleotide 

30 sequences provided herein can be advantageously used as probes or primers for nucleic 
acid hybridization. As such, it is contemplated that nucleic acid segments that comprise 
a sequence region of at least about 15 nucleotide long contiguous sequence that has the 
same sequence as, or is complementary to, a 15 nucleotide long contiguous sequence 
disclosed herein will find particular utility. Longer contiguous identical or 

35 complementary sequences, e.g., those of about 20, 30, 40, 50, 100, 200, 500, 1000 
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(including all intermediate lengths) and even up to full length sequences will also be of 
use in certain embodiments. 

The ability of such nucleic acid probes to specifically hybridize to a 
sequence of interest will enable them to be of use in detecting the presence of 
5 complementary sequences in a given sample. However, other uses are also envisioned, 
such as the use of the sequence information for the preparation of mutant species 
primers, or primers for use in preparing other genetic constructions. 

Polynucleotide molecules having sequence regions consisting of 
contiguous nucleotide stretches of 10-14, 15-20, 30, 50, or even of 100-200 nucleotides 

10 or so (including intermediate lengths as well), identical or complementary to a 
polynucleotide sequence disclosed herein, are particularly contemplated as hybridization 
probes for use in, e.g., Southern and Northern blotting. This would allow a gene 
product, or fragment thereof, to be analyzed, both in diverse cell types and also in 
various bacterial cells. The total size of fragment, as well as the size of the 

15 complementary stretch(es), will ultimately depend on the intended use or application of 
the particular nucleic acid segment. Smaller fragments will generally find use in 
hybridization embodiments, wherein the length of the contiguous complementary region 
may be varied, such as between about 15 and about 100 nucleotides, but larger 
contiguous complementarity stretches may be used, according to the length 

20 complementary sequences one wishes to detect. 

The use of a hybridization probe of about 15-25 nucleotides in length 
allows the formation of a duplex molecule that is both stable and selective. Molecules 
having contiguous complementary sequences over stretches greater than 15 bases in 
length are generally preferred, though, in order to increase stability and selectivity of the 

25 hybrid, and thereby improve the quality and degree of specific hybrid molecules 
obtained. One will generally prefer to design nucleic acid molecules having gene- 
complementary stretches of 15 to 25 contiguous nucleotides, or even longer where 
desired. 

Hybridization probes may be selected from any portion of any of the 
30 sequences disclosed herein. All that is required is to review the sequences set forth 
herein, or to any continuous portion of the sequences, from about 15-25 nucleotides in 
length up to and including the full length sequence, that one wishes to utilize as a probe 
or primer. The choice of probe and primer sequences may be governed by various 
factors. For example, one may wish to employ primers from towards the termini of the 
35 total sequence. 
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Small polynucleotide segments or fragments may be readily prepared by, 
for example, directly synthesizing the fragment by chemical means, as is commonly 
practiced using an automated oligonucleotide synthesizer. Also, fragments may be 
obtained by application of nucleic acid reproduction technology, such as the PCR™ 
5 technology of U. S. Patent 4,683,202 (incorporated herein by reference), by introducing 
selected sequences into recombinant vectors for recombinant production, and by other 
recombinant DNA techniques generally known to those of skill in the art of molecular 
biology. 

The nucleotide sequences of the invention may be used for their ability to 

10 selectively form duplex molecules with complementary stretches of the entire gene or 
gene fragments of interest. Depending on the application envisioned, one will typically 
desire to employ varying conditions of hybridization to achieve varying degrees of 
selectivity of probe towards target sequence. For applications requiring high selectivity, 
one will typically desire to employ relatively stringent conditions to form the hybrids, 

15 e.g., one will select relatively low salt and/or high temperature conditions, such as 
provided by a salt concentration of from about 0.02 M to about 0.15 M salt at 
temperatures of from about 50°C to about 70°C. Such selective conditions tolerate 
little, if any, mismatch between the probe and the template or target strand, and would 
be particularly suitable for isolating related sequences. 

20 Of course, for some applications, for example, where one desires to 

prepare mutants employing a mutant primer strand hybridized to an underlying 
template, less stringent (reduced stringency) hybridization conditions will typically be 
needed in order to allow formation of the heteroduplex. In these circumstances, one 
may desire to employ salt conditions such as those of from about 0.15 M to about 0.9 M 

25 salt, at temperatures ranging from about 20°C to about 55°C. Cross-hybridizing species 
can thereby be readily identified as positively hybridizing signals with respect to control 
hybridizations. In any case, it is generally appreciated that conditions can be rendered 
more stringent by the addition of increasing amounts of formamide, which serves to 
destabilize the hybrid duplex in the same manner as increased temperature. Thus, 

30 hybridization conditions can be readily manipulated, and thus will generally be a 
method of choice depending on the desired results. 

According to another embodiment of the present invention, 
polynucleotide compositions comprising antisense oligonucleotides are provided. 
Antisense oligonucleotides have been demonstrated to be effective and targeted 

35 inhibitors of protein synthesis, and, consequently, provide a therapeutic approach by 
which a disease can be treated by inhibiting the synthesis of proteins that contribute to 
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the disease. The efficacy of antisense oligonucleotides for inhibiting protein synthesis 
is well established. For example, the synthesis of polygalactauronase and the muscarine 
type 2 acetylcholine receptor are inhibited by antisense oligonucleotides directed to their 
respective mRNA sequences (U. S. Patent 5,739,119 and U. S. Patent 5,759,829). 
5 Further, examples of antisense inhibition have been demonstrated with the nuclear 
protein cyclin, the multiple drug resistance gene (MDG1), ICAM-1, E-selectin, STK-1, 
striatal GABAa receptor and human EGF (Jaskulski et al, Science. 1988 Jun 
10;240(4858): 1544-6; Vasanthakumar and Ahmed, Cancer Commun. 1989;1(4):225- 
32; Peris et al, Brain Res Mol Brain Res. 1998 Jun 15;57(2):3 10-20; U. S. Patent 

10 5,801,154; U.S. Patent 5,789,573; U. S. Patent 5,718,709 and U.S. Patent 5,610,288). 
Antisense constructs have also been described that inhibit and can be used to treat a 
variety of abnormal cellular proliferations, e.g. cancer (U. S. Patent 5,747,470; U. S. 
Patent 5,591,317 and U. S. Patent 5,783,683). 

Therefore, in certain embodiments, the present invention provides 

15 oligonucleotide sequences that comprise all, or a portion of, any sequence that is 
capable of specifically binding to polynucleotide sequence described herein, or a 
complement thereof. In one embodiment, the antisense oligonucleotides comprise DNA 
or derivatives thereof. In another embodiment, the oligonucleotides comprise RNA or 
derivatives thereof. In a third embodiment, the oligonucleotides are modified DNAs 

20 comprising a phosphorothioated modified backbone. In a fourth embodiment, the 
oligonucleotide sequences comprise peptide nucleic acids or derivatives thereof. In 
each case, preferred compositions comprise a sequence region that is complementary, 
and more preferably substantially-complementary, and even more preferably, 
completely complementary to one or more portions of polynucleotides disclosed herein. 

25 Selection of antisense compositions specific for a given gene sequence is based upon 
analysis of the chosen target sequence and determination of secondary structure, T m , 
binding energy, and relative stability. Antisense compositions may be selected based 
upon their relative inability to form dimers, hairpins, or other secondary structures that 
would reduce or prohibit specific binding to the target mRNA in a host cell. Highly 

30 preferred target regions of the mRNA, are those which are at or near the AUG 
translation initiation codon, and those sequences which are substantially complementary 
to 5' regions of the mRNA. These secondary structure analyses and target site selection 
considerations can be performed, for example, using v.4 of the OLIGO primer analysis 
software and/or the BLASTN 2.0.5 algorithm software (Altschul et al, Nucleic Acids 

35 Res. 1997, 25(17):3389-402). 
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The use of an antisense delivery method employing a short peptide 
vector, termed MPG (27 residues), is also contemplated. The MPG peptide contains a 
hydrophobic domain derived from the fusion sequence of HIV gp41 and a hydrophilic 
domain from the nuclear localization sequence of SV40 T-antigen (Morris et ah, 
5 Nucleic Acids Res. 1997 Jul 15;25(14):2730-6). It has been demonstrated that several 
molecules of the MPG peptide coat the antisense oligonucleotides and can be delivered 
into cultured mammalian cells in less than 1 hour with relatively high efficiency (90%). 
Further, the interaction with MPG strongly increases both the stability of the 
oligonucleotide to nuclease and the ability to cross the plasma membrane. 

10 According to another embodiment of the invention, the polynucleotide 

compositions described herein are used in the design and preparation of ribozyme 
molecules for inhibiting expression of the tumor polypeptides and proteins of the 
present invention in tumor cells. Ribozymes are RNA-protein complexes that cleave 
nucleic acids in a site-specific fashion. Ribozymes have specific catalytic domains that 

15 possess endonuclease activity (Kim and Cech, Proc Natl Acad Sci USA. 1987 
Dec;84(24):8788-92; Forster and Symons, Cell. 1987 Apr 24;49(2):211-20). For 
example, a large number of ribozymes accelerate phosphoester transfer reactions with a 
high degree of specificity, often cleaving only one of several phosphoesters in an 
oligonucleotide substrate (Cech et al, Cell. 1981 Dec;27(3 Pt 2):487-96; Michel and 

20 Westhof, J Mol Biol. 1990 Dec 5;216(3):585-610; Reinhold-Hurek and Shub, Nature. 
1992 May 14;357(6374):173-6). This specificity has been attributed to the requirement 
that the substrate bind via specific base-pairing interactions to the internal guide 
sequence ("IGS") of the ribozyme prior to chemical reaction. 

Six basic varieties of naturally occurring enzymatic RNAs are known 

25 presently. Each can catalyze the hydrolysis of RNA phosphodiester bonds in trans (and 
thus can cleave other RNA molecules) under physiological conditions. In general, 
enzymatic nucleic acids act by first binding to a target RNA. Such binding occurs 
through the target binding portion of a enzymatic nucleic acid which is held in close 
proximity to an enzymatic portion of the molecule that acts to cleave the target RNA. 

30 Thus, the enzymatic nucleic acid first recognizes and then binds a target RNA through 
complementary base-pairing, and once bound to the correct site, acts enzymatically to 
cut the target RNA. Strategic cleavage of such a target RNA will destroy its ability to 
direct synthesis of an encoded protein. After an enzymatic nucleic acid has bound and 
cleaved its RNA target, it is released from that RNA to search for another target and can 

35 repeatedly bind and cleave new targets. 
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The enzymatic nature of a ribozyme is advantageous over many 
technologies, such as antisense technology (where a nucleic acid molecule simply binds 
to a nucleic acid target to block its translation) since the concentration of ribozyme 
necessary to affect a therapeutic treatment is lower than that of an antisense 
5 oligonucleotide. This advantage reflects the ability of the ribozyme to act 
enzymatically. Thus, a single ribozyme molecule is able to cleave many molecules of 
target RNA. In addition, the ribozyme is a highly specific inhibitor, with the specificity 
of inhibition depending not only on the base pairing mechanism of binding to the target 
RNA, but also on the mechanism of target RNA cleavage. Single mismatches, or base- 

10 substitutions, near the site of cleavage can completely eliminate catalytic activity of a 
ribozyme. Similar mismatches in antisense molecules do not prevent their action 
(Woolf etal, Proc Natl Acad Sci USA. 1992 Aug 15;89(16):7305-9). Thus, the 
specificity of action of a ribozyme is greater than that of an antisense oligonucleotide 
binding the same RNA site. 

15 The enzymatic nucleic acid molecule may be formed in a hammerhead, 

hairpin, a hepatitis 8 virus, group I intron or RNaseP RNA (in association with an RNA 
guide sequence) or Neurospora VS RNA motif. Examples of hammerhead motifs are 
described by Rossi et al Nucleic Acids Res. 1992 Sep ll;20(17):4559-65. Examples of 
hairpin motifs are described by Hampel etal. (Eur. Pat. Appl. Publ. No. EP 0360257), 

20 Hampel and Tritz, Biochemistry 1989 Jun 13;28(12):4929-33; Hampel etal, Nucleic 
Acids Res. 1990 Jan 25;18(2):299-304 and U. S. Patent 5,631,359. An example of the 
hepatitis 5 virus motif is described by Perrotta and Been, Biochemistry. 1992 Dec 
1;3 1(47): 11843-52; an example of the RNaseP motif is described by Guerrier-Takada 
etal, Cell. 1983 Dec;35(3 Pt 2):849-57; Neurospora VS RNA ribozyme motif is 

25 described by Collins (Saville and Collins, Cell. 1990 May 18;61(4):685-96; Saville and 
Collins, Proc Natl Acad Sci USA. 1991 Oct l;88(19):8826-30; Collins and Olive, 
Biochemistry. 1993 Mar 23;32(ll):2795-9); and an example of the Group I intron is 
described in (U. S. Patent 4,987,071). All that is important in an enzymatic nucleic acid 
molecule of this invention is that it has a specific substrate binding site which is 

30 complementary to one or more of the target gene RNA regions, and that it have 
nucleotide sequences within or surrounding that substrate binding site which impart an 
RNA cleaving activity to the molecule. Thus the ribozyme constructs need not be 
limited to specific motifs mentioned herein. 

Ribozymes may be designed as described in Int. Pat. Appl. Publ. No. 

35 WO 93/23569 and Int. Pat. Appl. Publ. No. WO 94/02595, each specifically 
incorporated herein by reference) and synthesized to be tested in vitro and in vivo, as 
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described. Such ribozymes can also be optimized for delivery. While specific 
examples are provided, those in the art will recognize that equivalent RNA targets in 
other species can be utilized when necessary. 

Ribozyme activity can be optimized by altering the length of the 
5 ribozyme binding arms, or chemically synthesizing ribozymes with modifications that 
prevent their degradation by serum ribonucleases (see e.g., Int. Pat. Appl. Publ. No. WO 
92/07065; Int. Pat. Appl. Publ. No. WO 93/15187; Int. Pat. Appl. Publ. No. WO 
91/03162; Eur. Pat. Appl. Publ. No. 92110298.4; U. S. Patent 5,334,711; and Int. Pat. 
Appl. Publ. No. WO 94/13688, which describe various chemical modifications that can 

10 be made to the sugar moieties of enzymatic RNA molecules), modifications which 
enhance their efficacy in cells, and removal of stem II bases to shorten RNA synthesis 
times and reduce chemical requirements. 

Sullivan etal. (Int. Pat. Appl. Publ. No. WO 94/02595) describes the 
general methods for delivery of enzymatic RNA molecules. Ribozymes may be 

15 administered to cells by a variety of methods known to those familiar to the art, 
including, but not restricted to, encapsulation in liposomes, by iontophoresis, or by 
incorporation into other vehicles, such as hydrogels, cyclodextrins, biodegradable 
nanocapsules, and bioadhesive microspheres. For some indications, ribozymes may be 
directly delivered ex vivo to cells or tissues with or without the aforementioned vehicles. 

20 Alternatively, the RNA/vehicle combination may be locally delivered by direct 
inhalation, by direct injection or by use of a catheter, infusion pump or stent. Other 
routes of delivery include, but are not limited to, intravascular, intramuscular, 
subcutaneous or joint injection, aerosol inhalation, oral (tablet or pill form), topical, 
systemic, ocular, intraperitoneal and/or intrathecal delivery. More detailed descriptions 

25 of ribozyme delivery and administration are provided in Int. Pat. Appl. Publ. No. WO 
94/02595 and Int. Pat. Appl. Publ. No. WO 93/23569, each specifically incorporated 
herein by reference. 

Another means of accumulating high concentrations of a ribozyme(s) 
within cells is to incorporate the ribozyme-encoding sequences into a DNA expression 

30 vector. Transcription of the ribozyme sequences are driven from a promoter for 
eukaryotic RNA polymerase I (pol I), RNA polymerase II (pol II), or RNA polymerase 
III (pol III). Transcripts from pol II or pol III promoters will be expressed at high levels 
in all cells; the levels of a given pol II promoter in a given cell type will depend on the 
nature of the gene regulatory sequences (enhancers, silencers, etc.) present nearby. 

35 Prokaryotic RNA polymerase promoters may also be used, providing that the 
prokaryotic RNA polymerase enzyme is expressed in the appropriate cells. Ribozymes 
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expressed from such promoters have been shown to function in mammalian cells. Such 
transcription units can be incorporated into a variety of vectors for introduction into 
mammalian cells, including but not restricted to, plasmid DNA vectors, viral DNA 
vectors (such as adenovirus or adeno-associated vectors), or viral RNA vectors (such as 
5 retroviral, semliki forest virus, sindbis virus vectors). 

In another embodiment of the invention, peptide nucleic acids (PNAs) 
compositions are provided. PNA is a DNA mimic in which the nucleobases are 
attached to a pseudopeptide backbone (Good and Nielsen, Antisense Nucleic Acid Drug 
Dev. 1997 7(4) 431-37). PNA is able to be utilized in a number methods that 

10 traditionally have used RNA or DNA. Often PNA sequences perform better in 
techniques than the corresponding RNA or DNA sequences and have utilities that are 
not inherent to RNA or DNA. A review of PNA including methods of making, 
characteristics of, and methods of using, is provided by Corey {Trends Biotechnol 1997 
Jun;15(6):224-9). As such, in certain embodiments, one may prepare PNA sequences 

15 that are complementary to one or more portions of the ACE mRNA sequence, and such 
PNA compositions may be used to regulate, alter, decrease, or reduce the translation of 
ACE-specific mRNA, and thereby alter the level of ACE activity in a host cell to which 
such PNA compositions have been administered. 

PNAs have 2-aminoethyl-glycine linkages replacing the normal 

20 phosphodiester backbone of DNA (Nielsen et al, Science 1991 Dec 6;254(5037):1497- 
500; Hanvey et al, Science. 1992 Nov 27;258(5087):1481-5; Hyrup and Nielsen, 
Bioorg Med Chem. 1996 Jan;4(l):5-23). This chemistry has three important 
consequences: firstly, in contrast to DNA or phosphorothioate oligonucleotides, PNAs 
are neutral molecules; secondly, PNAs are achiral, which avoids the need to develop a 

25 stereoselective synthesis; and thirdly, PNA synthesis uses standard Boc or Fmoc 
protocols for solid-phase peptide synthesis, although other methods, including a 
modified Merrifield method, have been used. 

PNA monomers or ready-made oligomers are commercially available 
from PerSeptive Biosystems (Framingham, MA). PNA syntheses by either Boc or 

30 Fmoc protocols are straightforward using manual or automated protocols (Norton et al , 
Bioorg Med Chem. 1995 Apr;3(4):437-45). The manual protocol lends itself to the 
production of chemically modified PNAs or the simultaneous synthesis of families of 
closely related PNAs. 

As with peptide synthesis, the success of a particular PNA synthesis will 

35 depend on the properties of the chosen sequence. For example, while in theory PNAs 
can incorporate any combination of nucleotide bases, the presence of adjacent purines 
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can lead to deletions of one or more residues in the product. In expectation of this 
difficulty, it is suggested that, in producing PNAs with adjacent purines, one should 
repeat the coupling of residues likely to be added inefficiently. This should be followed 
by the purification of PNAs by reverse-phase high-pressure liquid chromatography, 
5 providing yields and purity of product similar to those observed during the synthesis of 
peptides. 

Modifications of PNAs for a given application may be accomplished by 
coupling amino acids during solid-phase synthesis or by attaching compounds that 
contain a carboxylic acid group to the exposed N-terminal amine. Alternatively, PNAs 

10 can be modified after synthesis by coupling to an introduced lysine or cysteine. The 
ease with which PNAs can be modified facilitates optimization for better solubility or 
for specific functional requirements. Once synthesized, the identity of PNAs and their 
derivatives can be confirmed by mass spectrometry. Several studies have made and 
utilized modifications of PNAs (for example, Norton et al, Bioorg Med Chem. 1995 

15 Apr;3(4):437-45; Petersen et al, J Pept Sci. 1995 May-Jun;l(3):175-83; Oram et al, 
Biotechniques. 1995 Sep;19(3):472-80; Footer et al, Biochemistry. 1996 Aug 
20;35(33):10673-9; Griffith et al, Nucleic Acids Res. 1995 Aug ll;23(15):3003-8; 
Pardridge et al, Proc Natl Acad Sci USA. 1995 Jun 6;92(12):5592-6; Boffa et al, 
Proc Natl Acad Sci USA. 1995 Mar 14;92(6):1901-5; Gambacorti-Passerini et al, 

20 Blood. 1996 Aug 15;88(4):141 1-7; Arinitage et al, Proc Natl Acad Sci USA. 1997 
Nov 11;94(23): 12320-5; Seeger et al, Biotechniques. 1997 Sep;23(3):512-7). U.S. 
Patent No. 5,700,922 discusses PNA-DNA-PNA chimeric molecules and their uses in 
diagnostics, modulating protein in organisms, and treatment of conditions susceptible to 
therapeutics. 

25 Methods of characterizing the antisense binding properties of PNAs are 

discussed in Rose (Anal Chem. 1993 Dec 15;65(24):3545-9) and Jensen et al 
(Biochemistry. 1997 Apr 22;36(16):5072-7). Rose uses capillary gel electrophoresis to 
determine binding of PNAs to their complementary oligonucleotide, measuring the 
relative binding kinetics and stoichiometry. Similar types of measurements were made 

30 by Jensen et al. using BIAcore™ technology. 

Other applications of PNAs that have been described and will be 
apparent to the skilled artisan include use in DNA strand invasion, antisense inhibition, 
mutational analysis, enhancers of transcription, nucleic acid purification, isolation of 
transcriptionally active genes, blocking of transcription factor binding, genome 

35 cleavage, biosensors, in situ hybridization, and the like. 
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Polynucleotide Identification, Characterization and Expression 

Polynucleotides compositions of the present invention may be identified, 
prepared and/or manipulated using any of a variety of well established techniques (see 
generally, Sambrook et al., Molecular Cloning: A Laboratory Manual, Cold Spring 
5 Harbor Laboratories, Cold Spring Harbor, NY, 1989, and other like references). For 
example, a polynucleotide may be identified, as described in more detail below, by 
screening a microarray of cDNAs for tumor-associated expression (i.e., expression that 
is at least two fold greater in a tumor than in normal tissue, as determined using a 
representative assay provided herein). Such screens may be performed, for example, 

10 using the microarray technology of Affymetrix, Inc. (Santa Clara, CA) according to the 
manufacturer's instructions (and essentially as described by Schena et al., Proc. Natl. 
Acad. Sci. USA P3:10614-10619, 1996 and Heller et al, Proc. Natl. Acad. Sci. USA 
94:2 150-2 155, 1997). Alternatively, polynucleotides may be amplified from cDNA 
prepared from cells expressing the proteins described herein, such as tumor cells. 

15 Many template dependent processes are available to amplify a target 

sequences of interest present in a sample. One of the best known amplification methods 
is the polymerase chain reaction (PCR™) which is described in detail in U.S. Patent 
Nos. 4,683,195, 4,683,202 and 4,800,159, each of which is incorporated herein by 
reference in its entirety. Briefly, in PCR™, two primer sequences are prepared which 

20 are complementary to regions on opposite complementary strands of the target 
sequence. An excess of deoxynucleoside triphosphates is added to a reaction mixture 
along with a DNA polymerase (e.g., Taq polymerase). If the target sequence is present 
in a sample, the primers will bind to the target and the polymerase will cause the 
primers to be extended along the target sequence by adding on nucleotides. By raising 

25 and lowering the temperature of the reaction mixture, the extended primers will 
dissociate from the target to form reaction products, excess primers will bind to the 
target and to the reaction product and the process is repeated. Preferably reverse 
transcription and PCR™ amplification procedure may be performed in order to quantify 
the amount of mRNA amplified. Polymerase chain reaction methodologies are well 

30 known in the art. 

Any of a number of other template dependent processes, many of which 
are variations of the PCR ™ amplification technique, are readily known and available in 
the art. Illustratively, some such methods include the ligase chain reaction (referred to 
as LCR), described, for example, in Eur. Pat. Appl. Publ. No. 320,308 and U.S. Patent 

35 No. 4,883,750; Qbeta Replicase, described in PCT Intl. Pat. Appl. Publ. No. 
PCT/US 87/008 80; Strand Displacement Amplification (SDA) and Repair Chain 
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Reaction (RCR). Still other amplification methods are described in Great Britain Pat. 
Appl. No. 2 202 328, and in PCT Intl. Pat. Appl. Publ. No. PCT/US89/01025. Other 
nucleic acid amplification procedures include transcription-based amplification systems 
(TAS) (PCT Intl. Pat. Appl. Publ. No. WO 88/10315), including nucleic acid sequence 
5 based amplification (NASBA) and 3 SR. Eur. Pat. Appl. Publ. No. 329,822 describes a 
nucleic acid amplification process involving cyclically synthesizing single-stranded 
RNA ("ssRNA"), ssDNA, and double-stranded DNA (dsDNA). PCT Intl. Pat. Appl. 
Publ. No. WO 89/06700 describes a nucleic acid sequence amplification scheme based 
on the hybridization of a promoter/primer sequence to a target single-stranded DNA 

10 ("ssDNA") followed by transcription of many RNA copies of the sequence. Other 
amplification methods such as "RACE" (Frohman, 1990), and "one-sided PGR" (Ohara, 
1989) are also well-known to those of skill in the art. 

An amplified portion of a polynucleotide of the present invention may be 
used to isolate a full length gene from a suitable library (e.g., a tumor cDNA library) 

15 using well known techniques. Within such techniques, a library (cDNA or genomic) is 
screened using one or more polynucleotide probes or primers suitable for amplification. 
Preferably, a library is size-selected to include larger molecules. Random primed 
libraries may also be preferred for identifying 5' and upstream regions of genes. 
Genomic libraries are preferred for obtaining introns and extending 5' sequences. 

20 For hybridization techniques, a partial sequence may be labeled (e.g., by 

nick-translation or end-labeling with 32 P) using well-known techniques. A bacterial or 
bacteriophage library is then generally screened by hybridizing filters containing 
denatured bacterial colonies (or lawns containing phage plaques) with the labeled probe 
(see Sambrook et al., Molecular Cloning: A Laboratory Manual, Cold Spring Harbor 

25 Laboratories, Cold Spring Harbor, NY, 1989). Hybridizing colonies or plaques are 
selected and expanded, and the DNA is isolated for further analysis. cDNA clones may 
be analyzed to determine the amount of additional sequence by, for example, PCR using 
a primer from the partial sequence and a primer from the vector. Restriction maps and 
partial sequences may be generated to identify one or more overlapping clones. The 

30 complete sequence may then be determined using standard techniques, which may 
involve generating a series of deletion clones. The resulting overlapping sequences can 
then assembled into a single contiguous sequence. A full length cDNA molecule can be 
generated by ligating suitable fragments, using well known techniques. 

Alternatively, amplification techniques, such as those described above, 

35 can be useful for obtaining a full length coding sequence from a partial cDNA sequence. 
One such amplification technique is inverse PCR (see Triglia et al, Nucl. Acids Res. 
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76:8186, 1988), which uses restriction enzymes to generate a fragment in the known 
region of the gene. The fragment is then circularized by intramolecular ligation and 
used as a template for PGR with divergent primers derived from the known region. 
Within an alternative approach, sequences adjacent to a partial sequence may be 
5 retrieved by amplification with a primer to a linker sequence and a primer specific to a 
known region. The amplified sequences are typically subjected to a second round of 
amplification with the same linker primer and a second primer specific to the known 
region. A variation on this procedure, which employs two primers that initiate 
extension in opposite directions from the known sequence, is described in WO 

10 96/38591. Another such technique is known as "rapid amplification of cDNA ends" or 
RACE. This technique involves the use of an internal primer and an external primer, 
which hybridizes to a polyA region or vector sequence, to identify sequences that are 5' 
and 3' of a known sequence. Additional techniques include capture PCR (Lagerstrom et 
al., PCR Methods Applic. 7:111-19, 1991) and walking PCR (Parker et al., Nucl. Acids. 

15 Res. 72:3055-60, 1991). Other methods employing amplification may also be employed 
to obtain a full length cDNA sequence. 

In certain instances, it is possible to obtain a full length cDNA sequence 
by analysis of sequences provided in an expressed sequence tag (EST) database, such as 
that available from GenBank. Searches for overlapping ESTs may generally be 

20 performed using well known programs (e.g., NCBI BLAST searches), and such ESTs 
may be used to generate a contiguous full length sequence. Full length DNA sequences 
may also be obtained by analysis of genomic fragments. 

In other embodiments of the invention, polynucleotide sequences or 
fragments thereof which encode polypeptides of the invention, or fusion proteins or 

25 functional equivalents thereof, may be used in recombinant DNA molecules to direct 
expression of a polypeptide in appropriate host cells. Due to the inherent degeneracy of 
the genetic code, other DNA sequences that encode substantially the same or a 
functionally equivalent amino acid sequence may be produced and these sequences may 
be used to clone and express a given polypeptide. 

30 As will be understood by those of skill in the art, it may be advantageous 

in some instances to produce polypeptide-encoding nucleotide sequences possessing 
non-naturally occurring codons. For example, codons preferred by a particular 
prokaryotic or eukaryotic host can be selected to increase the rate of protein expression 
or to produce a recombinant RNA transcript having desirable properties, such as a half- 

35 life which is longer than that of a transcript generated from the naturally occurring 
sequence. 
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Moreover, the polynucleotide sequences of the present invention can be 
engineered using methods generally known in the art in order to alter polypeptide 
encoding sequences for a variety of reasons, including but not limited to, alterations 
which modify the cloning, processing, and/or expression of the gene product. For 
5 example, DNA shuffling by random fragmentation and PCR reassembly of gene 
fragments and synthetic oligonucleotides may be used to engineer the nucleotide 
sequences. In addition, site-directed mutagenesis may be used to insert new restriction 
sites, alter glycosylation patterns, change codon preference, produce splice variants, or 
introduce mutations, and so forth. 

10 In another embodiment of the invention, natural, modified, or 

recombinant nucleic acid sequences may be ligated to a heterologous sequence to 
encode a fusion protein. For example, to screen peptide libraries for inhibitors of 
polypeptide activity, it may be useful to encode a chimeric protein that can be 
recognized by a commercially available antibody. A fusion protein may also be 

15 engineered to contain a cleavage site located between the polypeptide-encoding 
sequence and the heterologous protein sequence, so that the polypeptide may be cleaved 
and purified away from the heterologous moiety. 

Sequences encoding a desired polypeptide may be synthesized, in whole 
or in part, using chemical methods well known in the art (see Caruthers, M. H. et al. 

20 (1980) Nucl Acids Res. Symp. Ser. 215-223, Horn, T. et al. (1980) Nucl. Acids Res. 
Symp. Ser. 225-232). Alternatively, the protein itself may be produced using chemical 
methods to synthesize the amino acid sequence of a polypeptide, or a portion thereof. 
For example, peptide synthesis can be performed using various solid-phase techniques 
(Roberge, J. Y. et al. (1995) Science 269:202-204) and automated synthesis may be 

25 achieved, for example, using the ABI 431 A Peptide Synthesizer (Perkin Elmer, Palo 
Alto, CA). 

A newly synthesized peptide may be substantially purified by preparative 
high performance liquid chromatography (e.g., Creighton, T. (1983) Proteins, Structures 
and Molecular Principles, WH Freeman and Co., New York, N.Y.) or other comparable 

30 techniques available in the art. The composition of the synthetic peptides may be 
confirmed by amino acid analysis or sequencing (e.g., the Edman degradation 
procedure). Additionally, the amino acid sequence of a polypeptide, or any part thereof, 
may be altered during direct synthesis and/or combined using chemical methods with 
sequences from other proteins, or any part thereof, to produce a variant polypeptide. 

35 In order to express a desired polypeptide, the nucleotide sequences 

encoding the polypeptide, or functional equivalents, may be inserted into appropriate 
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expression vector, i.e., a vector which contains the necessary elements for the 
transcription and translation of the inserted coding sequence. Methods which are well 
known to those skilled in the art may be used to construct expression vectors containing 
sequences encoding a polypeptide of interest and appropriate transcriptional and 
5 translational control elements. These methods include in vitro recombinant DNA 
techniques, synthetic techniques, and in vivo genetic recombination. Such techniques 
are described, for example, in Sambrook, J. et al. (1989) Molecular Cloning, A 
Laboratory Manual, Cold Spring Harbor Press, Plainview, N.Y., and Ausubel, F. M. et 
al. (1989) Current Protocols in Molecular Biology, John Wiley & Sons, New York. 
10 NY. 

A variety of expression vector/host systems may be utilized to contain 
and express polynucleotide sequences. These include, but are not limited to, 
microorganisms such as bacteria transformed with recombinant bacteriophage, plasmid, 
or cosmid DNA expression vectors; yeast transformed with yeast expression vectors; 

15 insect cell systems infected with virus expression vectors (e.g., baculo virus); plant cell 
systems transformed with virus expression vectors (e.g., cauliflower mosaic virus, 
CaMV; tobacco mosaic virus, TMV) or with bacterial expression vectors (e.g., Ti or 
pBR322 plasmids); or animal cell systems. 

The "control elements" or "regulatory sequences" present in an 

20 expression vector are those non-translated regions of the vector—enhancers, promoters, 
5' and 3' untranslated regions—which interact with host cellular proteins to carry out 
transcription and translation. Such elements may vary in their strength and specificity. 
Depending on the vector system and host utilized, any number of suitable transcription 
and translation elements, including constitutive and inducible promoters, may be used. 

25 For example, when cloning in bacterial systems, inducible promoters such as the hybrid 
lacZ promoter of the PBLUESCRIPT phagemid (Stratagene, La Jolla, Calif.) or 
PSPORT1 plasmid (Gibco BRL, Gaithersburg, MD) and the like may be used. In 
mammalian cell systems, promoters from mammalian genes or from mammalian viruses 
are generally preferred. If it is necessary to generate a cell line that contains multiple 

30 copies of the sequence encoding a polypeptide, vectors based on SV40 or EBV may be 
advantageously used with an appropriate selectable marker. 

In bacterial systems, any of a number of expression vectors may be 
selected depending upon the use intended for the expressed polypeptide. For example, 
when large quantities are needed, for example for the induction of antibodies, vectors 

35 which direct high level expression of fusion proteins that are readily purified may be 
used. Such vectors include, but are not limited to, the multifunctional E. coli cloning 
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and expression vectors such as BLUESCRIPT (Stratagene), in which the sequence 
encoding the polypeptide of interest may be ligated into the vector in frame with 
sequences for the amino-terminal Met and the subsequent 7 residues of .beta.- 
galactosidase so that a hybrid protein is produced; pIN vectors (Van Heeke, G. and S. 
5 M. Schuster (1989) J. Biol. Chem. 254:5503-5509); and the like. pGEX Vectors 
(Promega, Madison, Wis.) may also be used to express foreign polypeptides as fusion 
proteins with glutathione S-transferase (GST). In general, such fusion proteins are 
soluble and can easily be purified from lysed cells by adsorption to glutathione-agarose 
beads followed by elution in the presence of free glutathione. Proteins made in such 
10 systems may be designed to include heparin, thrombin, or factor XA protease cleavage 
sites so that the cloned polypeptide of interest can be released from the GST moiety at 
will. 

In the yeast, Saccharomyces cerevisiae, a number of vectors containing 
constitutive or inducible promoters such as alpha factor, alcohol oxidase, and PGH may 

15 be used. For reviews, see Ausubel et al. (supra) and Grant et al. (1987) Methods 
Enzymol. 153:516-544. 

In cases where plant expression vectors are used, the expression of 
sequences encoding polypeptides may be driven by any of a number of promoters. For 
example, viral promoters such as the 35S and 19S promoters of CaMV may be used 

20 alone or in combination with the omega leader sequence from TMV (Takamatsu, N. 
(1987) EMBOJ. (5:307-311. Alternatively, plant promoters such as the small subunit of 
RUBISCO or heat shock promoters may be used (Coruzzi, G. et al. (1984) EMBO J. 
3:1671-1680; Broglie, R. et al. (1984) Science 224:838-843; and Winter, J. et al. (1991) 
Results Probl. Cell Differ. 77:85-105). These constructs can be introduced into plant 

25 cells by direct DNA transformation or pathogen-mediated transfection. Such techniques 
are described in a number of generally available reviews (see, for example, Hobbs, S. or 
Murry, L. E. in McGraw Hill Yearbook of Science and Technology (1992) McGraw 
Hill, New York, N.Y.; pp. 191-196). 

An insect system may also be used to express a polypeptide of interest. 

30 For example, in one such system, Autographa californica nuclear polyhedrosis virus 
(AcNPV) is used as a vector to express foreign genes in Spodoptera frugiperda cells or 
in Trichoplusia larvae. The sequences encoding the polypeptide may be cloned into a 
non-essential region of the virus, such as the polyhedrin gene, and placed under control 
of the polyhedrin promoter. Successful insertion of the polypeptide-encoding sequence 

35 will render the polyhedrin gene inactive and produce recombinant virus lacking coat 
protein. The recombinant viruses may then be used to infect, for example, S. frugiperda 
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cells or Trichoplusia larvae in which the polypeptide of interest may be expressed 
(Engelhard, E. K. et al. (1994) Proc. Natl. Acad. Set 91 :3224-3227). 

In mammalian host cells, a number of viral-based expression systems are 
generally available. For example, in cases where an adenovirus is used as an expression 
5 vector, sequences encoding a polypeptide of interest may be ligated into an adenovirus 
transcription/translation complex consisting of the late promoter and tripartite leader 
sequence. Insertion in a non-essential El or E3 region of the viral genome may be used 
to obtain a viable virus which is capable of expressing the polypeptide in infected host 
cells (Logan, J. and Shenk, T. (1984) Proc. Natl. Acad. Sci. 57:3655-3659). In addition, 

10 transcription enhancers, such as the Rous sarcoma virus (RSV) enhancer, may be used 
to increase expression in mammalian host cells. 

Specific initiation signals may also be used to achieve more efficient 
translation of sequences encoding a polypeptide of interest. Such signals include the 
ATG initiation codon and adjacent sequences. In cases where sequences encoding the 

15 polypeptide, its initiation codon, and upstream sequences are inserted into the 
appropriate expression vector, no additional transcriptional or translational control 
signals may be needed. However, in cases where only coding sequence, or a portion 
thereof, is inserted, exogenous translational control signals including the ATG initiation 
codon should be provided. Furthermore, the initiation codon should be in the correct 

20 reading frame to ensure translation of the entire insert. Exogenous translational 
elements and initiation codons may be of various origins, both natural and synthetic. 
The efficiency of expression may be enhanced by the inclusion of enhancers which are 
appropriate for the particular cell system which is used, such as those described in the 
literature (Scharf, D. et al. (1994) Results Probl. Cell Differ. 20:125-162). 

25 In addition, a host cell strain may be chosen for its ability to modulate 

the expression of the inserted sequences or to process the expressed protein in the 
« desired fashion. Such modifications of the polypeptide include, but are not limited to, 
acetylation, carboxylation. glycosylation, phosphorylation, lipidation, and acylation. 
Post-translational processing which cleaves a "prepro" form of the protein may also be 

30 used to facilitate correct insertion, folding and/or function. Different host cells such as 
CHO, COS, HeLa, MDCK, HEK293, and WD 8, which have specific cellular machinery 
and characteristic mechanisms for such post-translational activities, may be chosen to 
ensure the correct modification and processing of the foreign protein. 

For long-term, high-yield production of recombinant proteins, stable 

35 expression is generally preferred. For example, cell lines which stably express a 
polynucleotide of interest may be transformed using expression vectors which may 
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contain viral origins of replication and/or endogenous expression elements and a 
selectable marker gene on the same or on a separate vector. Following the introduction 
of the vector, cells may be allowed to grow for 1-2 days in an enriched media before 
they are switched to selective media. The purpose of the selectable marker is to confer 
5 resistance to selection, and its presence allows growth and recovery of cells which 
successfully express the introduced sequences. Resistant clones of stably transformed 
cells may be proliferated using tissue culture techniques appropriate to the cell type. 

Any number of selection systems may be used to recover transformed 
cell lines. These include, but are not limited to, the herpes simplex virus thymidine 

10 kinase (Wigler, M. et al. (1977) Cell 11:223-32) and adenine phosphoribosyltransferase 
(Lowy, I. et al. (1990) Cell 22:817-23) genes which can be employed in tk.sup.- or 
aprt.sup.- cells, respectively. Also, antimetabolite, antibiotic or herbicide resistance can 
be used as the basis for selection; for example, dhfr which confers resistance to 
methotrexate (Wigler, M. et al. (1980) Proc. Natl. Acad. Sci. 77:3567-70); npt, which 

15 confers resistance to the aminoglycosides, neomycin and G-418 (Colbere-Garapin, F. et 
al (1981) J. Mol. Biol. 750:1-14); and als or pat, which confer resistance to 
chlorsulfuron and phosphinotricin acetyltransferase, respectively (Murry, supra). 
Additional selectable genes have been described, for example, trpB, which allows cells 
to utilize indole in place of tryptophan, or hisD, which allows cells to utilize histinol in 

20 place of histidine (Hartman, S. C. and R. C. Mulligan (1988) Proc. Natl. Acad. Sci. 
55:8047-51). The use of visible markers has gained popularity with such markers as 
anthocyanins, beta-glucuronidase and its substrate GUS, and luciferase and its substrate 
luciferin, being widely used not only to identify transformants, but also to quantify the 
amount of transient or stable protein expression attributable to a specific vector system 

25 (Rhodes, C. A. et al. (1995) Methods Mol. Biol. 55:121-13 1). 

Although the presence/absence of marker gene expression suggests that 
the gene of interest is also present, its presence and expression may need to be 
confirmed. For example, if the sequence encoding a polypeptide is inserted within a 
marker gene sequence, recombinant cells containing sequences can be identified by the 

30 absence of marker gene function. Alternatively, a marker gene can be placed in tandem 
with a polypeptide-encoding sequence under the control of a single promoter. 
Expression of the marker gene in response to induction or selection usually indicates 
expression of the tandem gene as well. 

Alternatively, host cells that contain and express a desired 

35 polynucleotide sequence may be identified by a variety of procedures known to those of 
skill in the art. These procedures include, but are not limited to, DNA-DNA or DNA- 
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RNA hybridizations and protein bioassay or immunoassay techniques which include, 
for example, membrane, solution, or chip based technologies for the detection and/or 
quantification of nucleic acid or protein. 

A variety of protocols for detecting and measuring the expression of 
5 polynucleotide-encoded products, using either polyclonal or monoclonal antibodies 
specific for the product are known in the art. Examples include enzyme-linked 
immunosorbent assay (ELISA), radioimmunoassay (RIA), and fluorescence activated 
cell sorting (FACS). A two-site, monoclonal-based immunoassay utilizing monoclonal 
antibodies reactive to two non-interfering epitopes on a given polypeptide may be 

10 preferred for some applications, but a competitive binding assay may also be employed. 
These and other assays are described, among other places, in Hampton, R. et al. (1990; 
Serological Methods, a Laboratory Manual, APS Press, St Paul. Minn.) and Maddox, D. 
E. etal. (1983; J. Exp. Med. 755:1211-1216). 

A wide variety of labels and conjugation techniques are known by those 

15 skilled in the art and may be used in various nucleic acid and amino acid assays. Means 
for producing labeled hybridization or PCR probes for detecting sequences related to 
polynucleotides include oligolabeling, nick translation, end-labeling or PCR 
amplification using a labeled nucleotide. Alternatively, the sequences, or any portions 
thereof may be cloned into a vector for the production of an mRNA probe. Such vectors 

20 are known in the art, are commercially available, and may be used to synthesize RNA 
probes in vitro by addition of an appropriate RNA polymerase such as T7, T3, or SP6 
and labeled nucleotides. These procedures may be conducted using a variety of 
commercially available kits. Suitable reporter molecules or labels, which may be used 
include radionuclides, enzymes, fluorescent, chemiluminescent, or chromogenic agents 

25 as well as substrates, cofactors, inhibitors, magnetic particles, and the like. 

Host cells transformed with a polynucleotide sequence of interest may be 
cultured under conditions suitable for the expression and recovery of the protein from 
cell culture. The protein produced by a recombinant cell may be secreted or contained 
intracellularly depending on the sequence and/or the vector used. As will be understood 

30 by those of skill in the art, expression vectors containing polynucleotides of the 
invention may be designed to contain signal sequences which direct secretion of the 
encoded polypeptide through a prokaryotic or eukaryotic cell membrane. Other 
recombinant constructions may be used to join sequences encoding a polypeptide of 
interest to nucleotide sequence encoding a polypeptide domain which will facilitate 

35 purification of soluble proteins. Such purification facilitating domains include, but are 
not limited to, metal chelating peptides such as histidine-tryptophan modules that allow 
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purification on immobilized metals, protein A domains that allow purification on 
immobilized immunoglobulin, and the domain utilized in the FLAGS extension/affinity 
purification system (Immunex Corp., Seattle, Wash.). The inclusion of cleavable linker 
sequences such as those specific for Factor XA or enterokinase (Invitrogen. San Diego, 
5 Calif.) between the purification domain and the encoded polypeptide may be used to 
facilitate purification. One such expression vector provides for expression of a fusion 
protein containing a polypeptide of interest and a nucleic acid encoding 6 histidine 
residues preceding a thioredoxin or an enterokinase cleavage site. The histidine residues 
facilitate purification on IMIAC (immobilized metal ion affinity chromatography) as 

10 described in Porath, J. et al. (1992, Prot. Exp. Purif. 5:263-281) while the enterokinase 
cleavage site provides a means for purifying the desired polypeptide from the fusion 
protein. A discussion of vectors which contain fusion proteins is provided in Kroll, D. J. 
et al. (1993; DNA Cell Biol. 72:441-453). 

In addition to recombinant production methods, polypeptides of the 

15 invention, and fragments thereof, may be produced by direct peptide synthesis using 
solid-phase techniques (Merrifield J. (1963) J. Am. Chem. Soc. 55:2149-2154). Protein 
synthesis may be performed using manual techniques or by automation. Automated 
synthesis may be achieved, for example, using Applied Biosystems 431 A Peptide 
Synthesizer (Perkin Elmer). Alternatively, various fragments may be chemically 

20 synthesized separately and combined using chemical methods to produce the full length 
molecule. 

Antibody Compositions, Fragments Thereof and Other Binding Agents 

According to another aspect, the present invention further provides 
binding agents, such as antibodies and antigen-binding fragments thereof, that exhibit 

25 immunological binding to a tumor polypeptide disclosed herein, or to a portion, variant 
or derivative thereof. An antibody, or antigen-binding fragment thereof, is said to 
"specifically bind," "immunogically bind," and/or is "immunologically reactive" to a 
polypeptide of the invention if it reacts at a detectable level (within, for example, an 
ELISA assay) with the polypeptide, and does not react detectably with unrelated 

30 polypeptides under similar conditions. 

Immunological binding, as used in this context, generally refers to the 
non-covalent interactions of the type which occur between an immunoglobulin 
molecule and an antigen for which the immunoglobulin is specific. The strength, or 
affinity of immunological binding interactions can be expressed in terms of the . 

35 dissociation constant (Kd) of the interaction, wherein a smaller Kd represents a greater 
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affinity. Immunological binding properties of selected polypeptides can be quantified 
using methods well known in the art. One such method entails measuring the rates of 
antigen-binding site/antigen complex formation and dissociation, wherein those rates 
depend on the concentrations of the complex partners, the affinity of the interaction, and 
5 on geometric parameters that equally influence the rate in both directions. Thus, both 
the "on rate constant" (K on ) and the "off rate constant" (K 0 ff) can be determined by 
calculation of the concentrations and the actual rates of association and dissociation. 
The ratio of Koff /K<> n enables cancellation of all parameters not related to affinity, and is 
thus equal to the dissociation constant Kd. See, generally, Davies et al. (1990) Annual 

10 Rev. Biochem. 59:439-473. 

An "antigen-binding site," or "binding portion" of an antibody refers to 
the part of the immunoglobulin molecule that participates in antigen binding. The 
antigen binding site is formed by amino acid residues of the N-terminal variable ("V") 
regions of the heavy ("H") and light ("L") chains. Three highly divergent stretches 

15 within the V regions of the heavy and light chains are referred to as "hypervariable 
regions" which are interposed between more conserved flanking stretches known as 
"framework regions," or "FRs". Thus the term "FR" refers to amino acid sequences 
which are naturally found between and adjacent to hypervariable regions in 
immunoglobulins. In an antibody molecule, the three hypervariable regions of a light 

20 chain and the three hypervariable regions of a heavy chain are disposed relative to each 
other in three dimensional space to form an antigen-binding surface. The antigen- 
binding surface is complementary to the three-dimensional surface of a bound antigen, 
and the three hypervariable regions of each of the heavy and light chains are referred to 
as "complementarity-determining regions," or "CDRs." 

25 Binding agents may be further capable of differentiating between patients 

with and without a cancer, such as lung cancer, using the representative assays provided 
herein. For example, antibodies or other binding agents that bind to a tumor protein 
will preferably generate a signal indicating the presence of a cancer in at least about 
20% of patients with the disease, more preferably at least about 30% of patients. 

30 Alternatively, or in addition, the antibody will generate a negative signal indicating the 
absence of the disease in at least about 90% of individuals without the cancer. To 
determine whether a binding agent satisfies this requirement, biological samples (e.g., 
blood, sera, sputum, urine and/or tumor biopsies) from patients with and without a 
cancer (as determined using standard clinical tests) may be assayed as described herein 

35 for the presence of polypeptides that bind to the binding agent. Preferably, a statistically 
significant number of samples with and without the disease will be assayed. Each 
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binding agent should satisfy the above criteria; however, those of ordinary skill in the 
art will recognize that binding agents may be used in combination to improve 
sensitivity. 

Any agent that satisfies the above requirements may be a binding agent. 
5 For example, a binding agent may be a ribosome, with or without a peptide component, 
an RNA molecule or a polypeptide. In a preferred embodiment, a binding agent is an 
antibody or an antigen-binding fragment thereof. Antibodies may be prepared by any of 
a variety of techniques known to those of ordinary skill in the art. See, e.g., Harlow and 
Lane, Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory, 1988. In 

10 general, antibodies can be produced by cell culture techniques, including the generation 
of monoclonal antibodies as described herein, or via transfection of antibody genes into 
suitable bacterial or mammalian cell hosts, in order to allow for the production of 
recombinant antibodies. In one technique, an immunogen comprising the polypeptide is 
initially injected into any of a wide variety of mammals {e.g., mice, rats, rabbits, sheep 

15 or goats). In this step, the polypeptides of this invention may serve as the immunogen 
without modification. Alternatively, particularly for relatively short polypeptides, a 
superior immune response may be elicited if the polypeptide is joined to a carrier 
protein, such as bovine serum albumin or keyhole limpet hemocyanin. The immunogen 
is injected into the animal host, preferably according to a predetermined schedule 

20 incorporating one or more booster immunizations, and the animals are bled periodically. 
Polyclonal antibodies specific for the polypeptide may then be purified from such 
antisera by, for example, affinity chromatography using the polypeptide coupled to a 
suitable solid support. 

Monoclonal antibodies specific for an antigenic polypeptide of interest 

25 may be prepared, for example, using the technique of Kohler and Milstein, Eur. J. 
Immunol. (5:511-519, 1976, and improvements thereto. Briefly, these methods involve 
the preparation of immortal cell lines capable of producing antibodies having the 
desired specificity {i.e., reactivity with the polypeptide of interest). Such cell lines may 
be produced, for example, from spleen cells obtained from an animal immunized as 

30 described above. The spleen cells are then immortalized by, for example, fusion with a 
myeloma cell fusion partner, preferably one that is syngeneic with the immunized 
animal. A variety of fusion techniques may be employed. For example, the spleen cells 
and myeloma cells may be combined with a nonionic detergent for a few minutes and 
then plated at low density on a selective medium that supports the growth of hybrid 

35 cells, but not myeloma cells. A preferred selection technique uses HAT (hypoxanthine, 
aminopterin, thymidine) selection. After a sufficient time, usually about 1 to 2 weeks, 
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colonies of hybrids are observed. Single colonies are selected and their culture 
supernatants tested for binding activity against the polypeptide. Hybridomas having 
high reactivity and specificity are preferred. 

Monoclonal antibodies may be isolated from the supernatants of growing 
5 hybridoma colonies. In addition, various techniques may be employed to enhance the 
yield, such as injection of the hybridoma cell line into the peritoneal cavity of a suitable 
vertebrate host, such as a mouse. Monoclonal antibodies may then be harvested from 
the ascites fluid or the blood. Contaminants may be removed from the antibodies by 
conventional techniques, such as chromatography, gel filtration, precipitation, and 

10 extraction. The polypeptides of this invention may be used in the purification process 
in, for example, an affinity chromatography step. 

A number of therapeutically useful molecules are known in the art which 
comprise antigen-binding sites that are capable of exhibiting immunological binding 
properties of an antibody molecule. The proteolytic enzyme papain preferentially 

15 cleaves IgG molecules to yield several fragments, two of which (the 'T(ab)" fragments) 
each comprise a covalent heterodimer that includes an intact antigen-binding site. The 
enzyme pepsin is able to cleave IgG molecules to provide several fragments, including 
the "F(ab') 2 " fragment which comprises both antigen-binding sites. An "Fv" fragment 
can be produced by preferential proteolytic cleavage of an IgM, and on rare occasions 

20 IgG or IgA immunoglobulin molecule. Fv fragments are, however, more commonly 
derived using recombinant techniques known in the art. The Fv fragment includes a 
non-covalent Vh"Vl heterodimer including an antigen-binding site which retains much 
of the antigen recognition and binding capabilities of the native antibody molecule. 
Inbar et al. (1972) Proc. Nat. Acad. Sci. USA 69:2659-2662; Hochman et al. (1976) 

25 Biochem 15:2706-2710; and Ehrlich et al. (1980) Biochem 19:4091-4096. 

A single chain Fv ("sFv") polypeptide is a covalently linked Vh-Vl 
heterodimer which is expressed from a gene fusion including Vr- and VL-encoding 
genes linked by a peptide-encoding linker. Huston et al. (1988) Proc. Nat. Acad. Sci. 
USA 85(16):5879-5883. A number of methods have been described to discern chemical 

30 structures for converting the naturally aggregated~but chemically separated-light and 
heavy polypeptide chains from an antibody V region into an sFv molecule which will 
fold into a three dimensional structure substantially similar to the structure of an 
antigen-binding site. See, e.g., U.S. Pat Nos. 5,091,513 and 5,132,405, to Huston et al; 
and U.S. Pat. No. 4,946,778, to Ladner et al. 

35 Each of the above-described molecules includes a heavy chain and a 

light chain CDR set, respectively interposed between a heavy chain and a light chain FR 
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set which provide support to the CDRS and define the spatial relationship of the CDRs 
relative to each other. As used herein, the term "CDR set" refers to the three 
hypervariable regions of a heavy or light chain V region. Proceeding from the N- 
terminus of a heavy or light chain, these regions are denoted as "CDR1," "CDR2," and 
5 "CDR3" respectively. An antigen-binding site, therefore, includes six CDRs, 
comprising the CDR set from each of a heavy and a light chain V region. A polypeptide 
comprising a single CDR, (e.g., a CDR1, CDR2 or CDR3) is referred to herein as a 
"molecular recognition unit." Crystallographic analysis of a number of antigen-antibody 
complexes has demonstrated that the amino acid residues of CDRs form extensive 

10 contact with bound antigen, wherein the most extensive antigen contact is with the 
heavy chain CDR3. Thus, the molecular recognition units are primarily responsible for 
the specificity of an antigen-binding site. 

As used herein, the term "FR set" refers to the four flanking amino acid 
sequences which frame the CDRs of a CDR set of a heavy or light chain V region. 

15 Some FR residues may contact bound antigen; however, FRs are primarily responsible 
for folding the V region into the antigen-binding site, particularly the FR residues 
directly adjacent to the CDRS. Within FRs, certain amino residues and certain structural 
features are very highly conserved. In this regard, all V region sequences contain an 
internal disulfide loop of around 90 amino acid residues. When the V regions fold into a 

20 binding-site, the CDRs are displayed as projecting loop motifs which form an antigen- 
binding surface. It is generally recognized that there are conserved structural regions of 
FRs which influence the folded shape of the CDR loops into certain "canonical" 
structures-regardless of the precise CDR amino acid sequence. Further, certain FR 
residues are known to participate in non-covalent interdomain contacts which stabilize 

25 the interaction of the antibody heavy and light chains. 

A number of "humanized" antibody molecules comprising an antigen- 
binding site derived from a non-human immunoglobulin have been described, including 
chimeric antibodies having rodent V regions and their associated CDRs fused to human 
constant domains (Winter et al. (1991) Nature 349:293-299; Lobuglio et al. (1989) 

30 Proc. Nat. Acad. Sci. USA 86:4220-4224; Shaw et al. (1987) J Immunol. 138:4534- 
4538; and Brown et al. (1987) Cancer Res. 47:3577-3583), rodent CDRs grafted into a 
human supporting FR prior to fusion with an appropriate human antibody constant 
domain (Riechmann et al. (1988) Nature 332:323-327; Verhoeyen et al. (1988) Science 
239:1534-1536; and Jones et al. (1986) Nature 321:522-525), and rodent CDRs 

35 supported by recombinantly veneered rodent FRs (European Patent Publication No. 
519,596, published Dec. 23, 1992). These "humanized" molecules are designed to 



68 



WO 02/092001 



PCT7US02/14975 



minimize unwanted immunological response toward rodent antihuman antibody 
molecules which limits the duration and effectiveness of therapeutic applications of 
those moieties in human recipients. 

As used herein, the terms "veneered FRs" and "recombinantly veneered 
5 FRs" refer to the selective replacement of FR residues from, e.g., a. rodent heavy or light 
chain V region, with human FR residues in order to provide a xenogeneic molecule 
comprising an antigen-binding site which retains substantially all of the native FR 
polypeptide folding structure. Veneering techniques are based on the understanding that 
the ligand binding characteristics of an antigen-binding site are determined primarily by 

10 the structure and relative disposition of the heavy and light chain CDR sets within the 
antigen-binding surface. Davies et al. (1990) Ann. Rev. Biochem. 59:439-473. Thus, 
antigen binding specificity can be preserved in a humanized antibody only wherein the 
CDR structures, their interaction with each other, and their interaction with the rest of 
the V region domains are carefully maintained. By using veneering techniques, exterior 

15 (e.g., solvent-accessible) FR residues which are readily encountered by the immune 
system are selectively replaced with human residues to provide a hybrid molecule that 
comprises either a weakly immunogenic, or substantially non-immunogenic veneered 
surface. 

The process of veneering makes use of the available sequence data for 

20 human antibody variable domains compiled by Kabat et al., in Sequences of Proteins of 
Immunological Interest, 4th ed., (U.S. Dept. of Health and Human Services, U.S. 
Government Printing Office, 1987), updates to the Kabat database, and other accessible 
U.S. and foreign databases (both nucleic acid and protein). Solvent accessibilities of V 
region amino acids can be deduced from the known three-dimensional structure for 

25 human and murine antibody fragments. There are two general steps in veneering a 
murine antigen-binding site. Initially, the FRs of the variable domains of an antibody 
molecule of interest are compared with corresponding FR sequences of human variable 
domains obtained from the above-identified sources. The most homologous human V 
regions are then compared residue by residue to corresponding murine amino acids. The 

30 residues in the murine FR which differ from the human counterpart are replaced by the 
residues present in the human moiety using recombinant techniques well known in the 
art. Residue switching is only carried out with moieties which are at least partially 
exposed (solvent accessible), and care is exercised in the replacement of amino acid 
residues which may have a significant effect on the tertiary structure of V region 

35 domains, such as proline, glycine and charged amino acids. 
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In this manner, the resultant "veneered" murine antigen-binding sites are 
thus designed to retain the murine CDR residues, the residues substantially adjacent to 
the CDRs, the residues identified as buried or mostly buried (solvent inaccessible), the 
residues believed to participate in non-covalent (e.g., electrostatic and hydrophobic) 
5 contacts between heavy and light chain domains, and the residues from conserved 
structural regions of the FRs which are believed to influence the "canonical" tertiary 
structures of the CDR loops. These design criteria are then used to prepare recombinant 
nucleotide sequences which combine the CDRs of both the heavy and light chain of a 
murine antigen-binding site into human-appearing FRs that can be used to transfect 

10 mammalian cells for the expression of recombinant human antibodies which exhibit the 
antigen specificity of the murine antibody molecule. 

In another embodiment of the invention, monoclonal antibodies of the 
present invention may be coupled to one or more therapeutic agents. Suitable agents in 
this regard include radionuclides, differentiation inducers, drugs, toxins, and derivatives 

15 thereof. Preferred radionuclides include 90 Y, 123 I, 125 I, 131 I, I86 Re, I88 Re, 211 At, and 
212 Bi. Preferred drugs include methotrexate, and pyrimidine and purine analogs. 
Preferred differentiation inducers include phorbol esters and butyric acid. Preferred 
toxins include ricin, abrin, diptheria toxin, cholera toxin, gelonin, Pseudomonas 
exotoxin, Shigella toxin, and pokeweed antiviral protein. 

20 A therapeutic agent may be coupled (e.g., covalently bonded) to a 

suitable monoclonal antibody either directly or indirectly (e.g., via a linker group). A 
direct reaction between an agent and an antibody is possible when each possesses a 
substituent capable of reacting with the other. For example, a nucleophilic group, such 
as an amino or sulfhydryl group, on one may be capable of reacting with a carbonyl- 

25 containing group, such as an anhydride or an acid halide, or with an alkyl group 
containing a good leaving group (e.g., a halide) on the other. 

Alternatively, it may be desirable to couple a therapeutic agent and an 
antibody via a linker group. A linker group can function as a spacer to distance an 
antibody from an agent in order to avoid interference with binding capabilities. A linker 

30 group can also serve to increase the chemical reactivity of a substituent on an agent or 
an antibody, and thus increase the coupling efficiency. An increase in chemical 
reactivity may also facilitate the use of agents, or functional groups on agents, which 
otherwise would not be possible. 

It will be evident to those skilled in the art that a variety of bifunctional 

35 or polyfunctional reagents, both homo- and hetero-functional (such as those described in 
the catalog of the Pierce Chemical Co., Rockford, IL), may be employed as the linker 
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group. Coupling may be effected, for example, through amino groups, carboxyl groups, 
sulfhydryl groups or oxidized carbohydrate residues. There are numerous references 
describing such methodology, e.g., U.S. Patent No. 4,671,958, to Rodwell et al. 

Where a therapeutic agent is more potent when free from the antibody 
5 portion of the immunoconjugates of the present invention, it may be desirable to use a 
linker group which is cleavable during or upon internalization into a cell. A number of 
different cleavable linker groups have been described. The mechanisms for the 
intracellular release of an agent from these linker groups include cleavage by reduction 
of a disulfide bond (e.g., U.S. Patent No. 4,489,710, to Spitler), by irradiation of a 

10 photolabile bond (e.g., U.S. Patent No. 4,625,014, to Senter etal.), by hydrolysis of 
derivatized amino acid side chains (e.g., U.S. Patent No. 4,638,045, to Kohn et al.), by 
serum complement-mediated hydrolysis (e.g., U.S. Patent No. 4,671,958, to Rodwell 
et al.), and acid-catalyzed hydrolysis (e.g., U.S. Patent No. 4,569,789, to Blattler et al). 

It may be desirable to couple more than one agent to an antibody. In one 

15 embodiment, multiple molecules of an agent are coupled to one antibody molecule. In 
another embodiment, more than one type of agent may be coupled to one antibody. 
Regardless of the particular embodiment, immunoconjugates with more than one agent 
may be prepared in a variety of ways. For example, more than one agent may be 
coupled directly to an antibody molecule, or linkers that provide multiple sites for 

20 attachment can be used. Alternatively, a carrier can be used. 

A carrier may bear the agents in a variety of ways, including covalent 
bonding either directly or via a linker group. Suitable carriers include proteins such as 
albumins (e.g., U.S. Patent No. 4,507,234, to Kato et al.), peptides and polysaccharides 
such as aminodextran (e.g., U.S. Patent No. 4,699,784, to Shih et al). A carrier may 

25 also bear an agent by noncovalent bonding or by encapsulation, such as within a 
liposome vesicle (e.g., U.S. Patent Nos. 4,429,008 and 4,873,088). Carriers specific for 
radionuclide agents include radiohalogenated small molecules and chelating 
compounds. For example, U.S. Patent No. 4,735,792 discloses representative 
radiohalogenated small molecules and their synthesis. A radionuclide chelate may be 

30 formed from chelating compounds that include those containing nitrogen and sulfur 
atoms as the donor atoms for binding the metal, or metal oxide, radionuclide. For 
example, U.S. Patent No. 4,673,562, to Davison et al. discloses representative chelating 
compounds and their synthesis. 
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T Cell Compositions 

The present invention, in another aspect, provides T cells specific for a 
tumor polypeptide disclosed herein, or for a variant or derivative thereof. Such cells 
may generally be prepared in vitro or ex vivo, using standard procedures. For example, 
5 T cells may be isolated from bone marrow, peripheral blood, or a fraction of bone 
marrow or peripheral blood of a patient, using a commercially available cell separation 
system, such as the Isolex™ System, available from Nexell Therapeutics, Inc. (Irvine, 
CA; see also U.S. Patent No. 5,240,856; U.S. Patent No. 5,215,926; WO 89/06280; WO 
91/16116 and WO 92/07243). Alternatively, T cells may be derived from related or 

1 0 unrelated humans, non-human mammals, cell lines or cultures. 

T cells may be stimulated with a polypeptide, polynucleotide encoding a 
polypeptide and/or an antigen presenting cell (APC) that expresses such a polypeptide. 
Such stimulation is performed under conditions and for a time sufficient to permit the 
generation of T cells that are specific for the polypeptide of interest. Preferably, a tumor 

15 polypeptide or polynucleotide of the invention is present within a delivery vehicle, such 
as a microsphere, to facilitate the generation of specific T cells. 

T cells are considered to be specific for a polypeptide of the present 
invention if the T cells specifically proliferate, secrete cytokines or kill target cells 
coated with the polypeptide or expressing a gene encoding the polypeptide. T cell 

20 specificity may be evaluated using any of a variety of standard techniques. For 
example, within a chromium release assay or proliferation assay, a stimulation index of 
more than two fold increase in lysis and/or proliferation, compared to negative controls, 
indicates T cell specificity. Such assays may be performed, for example, as described in 
Chen et al, Cancer Res. 54:1065-1070, 1994. Alternatively, detection of the 

25 proliferation of T cells may be accomplished by a variety of known techniques. For 
example, T cell proliferation can be detected by measuring an increased rate of DNA 
synthesis (e.g., by pulse-labeling cultures of T cells with tritiated thymidine and 
measuring the amount of tritiated thymidine incorporated into DNA). Contact with a 
tumor polypeptide (100 ng/ml - 100 jag/ml, preferably 200 ng/ml - 25 ng/ml) for 3 - 7 

30 days will typically result in at least a two fold increase in proliferation of the T cells. 
Contact as described above for 2-3 hours should result in activation of the T cells, as 
measured using standard cytokine assays in which a two fold increase in the level of 
cytokine release (e.g., TNF or IFN-y) is indicative of T cell activation (see Coligan et 
al., Current Protocols in Immunology, vol. 1, Wiley Interscience (Greene 1998)). T 

35 cells that have been activated in response to a tumor polypeptide, polynucleotide or 
polypeptide-expressing APC may be CD4 + and/or CD8 + . Tumor polypeptide-specific T 
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cells may be expanded using standard techniques. Within preferred embodiments, the T 
cells are derived from a patient, a related donor or an unrelated donor, and are 
administered to the patient following stimulation and expansion. 

For therapeutic purposes, CD4 + or CD8 + T cells that proliferate in 
5 response to a tumor polypeptide, polynucleotide or APC can be expanded in number 
either in vitro or in vivo. Proliferation of such T cells in vitro may be accomplished in a 
variety of ways. For example, the T cells can be re-exposed to a tumor polypeptide, or a 
short peptide corresponding to an immunogenic portion of such a polypeptide, with or 
without the addition of T cell growth factors, such as interleukin-2, and/or stimulator 
10 cells that synthesize a tumor polypeptide. Alternatively, one or more T cells that 
proliferate in the presence of the tumor polypeptide can be expanded in number by 
cloning. Methods for cloning cells are well known in the art, and include limiting 
dilution. 

Pharmaceutical Compositions 

15 In additional embodiments, the present invention concerns formulation 

of one or more of the polynucleotide, polypeptide, T-cell and/or antibody compositions 
disclosed herein in pharmaceutically-acceptable carriers for administration to a cell or 
an animal, either alone, or in combination with one or more other modalities of therapy. 

It will be understood that, if desired, a composition as disclosed herein 

20 may be administered in combination with other agents as well, such as, e.g., other 
proteins or polypeptides or various pharmaceutically-active agents. In fact, there is 
virtually no limit to other components that may also be included, given that the 
additional agents do not cause a significant adverse effect upon contact with the target 
cells or host tissues. The compositions may thus be delivered along with various other 

25 agents as required in the particular instance. Such compositions may be purified from 
host cells or other biological sources, or alternatively may be chemically synthesized as 
described herein. Likewise, such compositions may further comprise substituted or 
derivatized RNA or DNA compositions. 

Therefore, in another aspect of the present invention, pharmaceutical 

30 compositions are provided comprising one or more of the polynucleotide, polypeptide, 
antibody, and/or T-cell compositions described herein in combination with a 
physiologically acceptable carrier. In certain preferred embodiments, the 
pharmaceutical compositions of the invention comprise immunogenic polynucleotide 
and/or polypeptide compositions of the invention for use in prophylactic and theraputic 

35 vaccine applications. Vaccine preparation is generally described in, for example, M.F. 
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Powell and M.J. Newman, eds., "Vaccine Design (the subunit and adjuvant approach)," 
Plenum Press (NY, 1995). Generally, such compositions will comprise one or more 
polynucleotide and/or polypeptide compositions of the present invention in combination 
with one or more immunostimulants. 
5 It will be apparent that any of the pharmaceutical compositions described 

herein can contain pharmaceutically acceptable salts of the polynucleotides and 
polypeptides of the invention. Such salts can be prepared, for example, from 
pharmaceutically acceptable non-toxic bases, including organic bases (e.g., salts of 
primary, secondary and tertiary amines and basic amino acids) and inorganic bases (e.g., 

10 sodium, potassium, lithium, ammonium, calcium and magnesium salts). 

In another embodiment, illustrative immunogenic compositions, e.g., 
vaccine compositions, of the present invention comprise DNA encoding one or more of 
the polypeptides as described above, such that the polypeptide is generated in situ. As 
noted above, the polynucleotide may be administered within any of a variety of delivery 

15 systems known to those of ordinary skill in the art. Indeed, numerous gene delivery 
techniques are well known in the art, such as those described by Rolland, Crit. Rev. 
Therap. Drug Carrier Systems 75:143-198, 1998, and references cited therein. 
Appropriate polynucleotide expression systems will, of course, contain the necessary 
regulatory DNA regulatory sequences for expression in a patient (such as a suitable 

20 promoter and terminating signal). Alternatively, bacterial delivery systems may involve 
the administration of a bacterium (such as Bacillus-Calmette-Guerrin) that expresses an 
immunogenic portion of the polypeptide on its cell surface or secretes such an epitope. 

Therefore, in certain embodiments, polynucleotides encoding 
immunogenic polypeptides described herein are introduced into suitable mammalian 

25 host cells for expression using any of a number of known viral-based systems. In one 
illustrative embodiment, retroviruses provide a convenient and effective platform for 
gene delivery systems. A selected nucleotide sequence encoding a polypeptide of the 
present invention can be inserted into a vector and packaged in retroviral particles using 
techniques known in the art. The recombinant virus can then be isolated and delivered 

30 to a subject. A number of illustrative retroviral systems have been described (e.g., U.S. 
Pat. No. 5,219,740; Miller and Rosman (1989) BioTechniques 7:980-990; Miller, A. D. 
(1990) Human Gene Therapy 1:5-14; Scarpa et al. (1991) Virology 180:849-852; Burns 
et al. (1993) Proc. Natl. Acad. Sci. USA 90:8033-8037; and Boris-Lawrie and Temin 
(1993) Cur. Opin. Genet. Develop. 3:102-109. 

35 In addition, a number of illustrative adeno virus-based systems have also 

been described. Unlike retroviruses which integrate into the host genome, adenoviruses 
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persist extrachromosomally thus minimizing the risks associated with insertional 
mutagenesis (Haj-Ahmad and Graham (1986) J. Virol. 57:267-274; Bett et al. (1993) J. 
Virol. 67:5911-5921; Mittereder et al. (1994) Human Gene Therapy 5:717-729; Seth et 
al. (1994) J. Virol. 68:933-940; Barr et al. (1994) Gene Therapy 1:51-58; Berkner, K. L. 
5 (1988) BioTechniques 6:616-629; and Rich et al. (1993) Human Gene Therapy 4:461- 
476). 

Various adeno-associated virus (AAV) vector systems have also been 
developed for polynucleotide delivery. AAV vectors can be readily constructed using 
techniques well known in the art. See, e.g., U.S. Pat. Nos. 5,173,414 and 5,139,941; 

10 International Publication Nos. WO 92/01070 and WO 93/03769; Lebkowski et al. 
(1988) Molec. Cell. Biol. 8:3988-3996; Vincent et al. (1990) Vaccines 90 (Cold Spring 
Harbor Laboratory Press); Carter, B. J. (1992) Current Opinion in Biotechnology 3:533- 
539; Muzyczka, N. (1992) Current Topics in Microbiol, and Immunol. 158:97-129; 
Kotin, R. M. (1994) Human Gene Therapy 5:793-801; Shelling and Smith (1994) Gene 

15 Therapy 1:165-169; and Zhou et al. (1994) J. Exp. Med. 179:1867-1875. 

Additional viral vectors useful for delivering the polynucleotides 
encoding polypeptides of the present invention by gene transfer include those derived 
from the pox family of viruses, such as vaccinia virus and avian poxvirus. By way of 
example, vaccinia virus recombinants expressing the novel molecules can be 

20 constructed as follows. The DNA encoding a polypeptide is first inserted into an 
appropriate vector so that it is adjacent to a vaccinia promoter and flanking vaccinia 
DNA sequences, such as the sequence encoding thymidine kinase (TK). This vector is 
then used to transfect cells which are simultaneously infected with vaccinia. 
Homologous recombination serves to insert the vaccinia promoter plus the gene 

25 encoding the polypeptide of interest into the viral genome. The resulting TK.sup.(-) 
recombinant can be selected by culturing the cells in the presence of 5- 
bromo deoxyuridine and picking viral plaques resistant thereto. 

A vaccinia-based infection/transfection system can be conveniently used 
to provide for inducible, transient expression or coexpression of one or more 

30 polypeptides described herein in host cells of an organism. In this particular system, 
cells are first infected in vitro with a vaccinia virus recombinant that encodes the 
bacteriophage T7 RNA polymerase. This polymerase displays exquisite specificity in 
that it only transcribes templates bearing T7 promoters. Following infection, cells are 
transfected with the polynucleotide or polynucleotides of interest, driven by a T7 

35 promoter. The polymerase expressed in the cytoplasm from the vaccinia virus 
recombinant transcribes the transfected DNA into RNA which is then translated into 
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polypeptide by the host translational machinery. The method provides for high level, 
transient, cytoplasmic production of large quantities of RNA and its translation 
products. See, e.g., Elroy-Stein and Moss, Proc. Natl. Acad. Sci. USA (1990) 87:6743- 
6747; Fuerst et al. Proc. Natl. Acad. Sci. USA (1986) 83:8122-8126. 
5 Alternatively, avipoxviruses, such as the fowlpox and canarypox viruses, 

can also be used to deliver the coding sequences of interest. Recombinant avipox 
viruses, expressing immunogens from mammalian pathogens, are known to confer 
protective immunity when administered to non-avian species. The use of an Avipox 
vector is particularly desirable in human and other mammalian species since members 

10 of the Avipox genus can only productively replicate in susceptible avian species and 
therefore are not infective in mammalian cells. Methods for producing recombinant 
Avipoxviruses are known in the art and employ genetic recombination, as described 
above with respect to the production of vaccinia viruses. See, e.g., WO 91/12882; WO 
89/03429; and WO 92/03545. 

15 Any of a number of alphavirus vectors can also be used for delivery of 

polynucleotide compositions of the present invention, such as those vectors described in 
U.S. Patent Nos. 5,843,723; 6,015,686; 6,008,035 and 6,015,694. Certain vectors based 
on Venezuelan Equine Encephalitis (VEE) can also be used, illustrative examples of 
which can be found in U.S. Patent Nos. 5,505,947 and 5,643,576. 

20 Moreover, molecular conjugate vectors, such as the adenovirus chimeric 

vectors described in Michael et al. J. Biol. Chem. (1993) 268:6866-6869 and Wagner et 
al. Proc. Natl. Acad. Sci. USA (1992) 89:6099-6103, can also be used for gene delivery 
under the invention. 

Additional illustrative information on these and other known viral-based 

25 delivery systems can be found, for example, in Fisher-Hoch et al., Proc. Natl. Acad. Sci. 
USA 86:3 17-321, 1989; Flexner et al., Ann. NY. Acad. Sci. 559:86-103, 1989; Flexner 
et al., Vaccine 5:17-21, 1990; U.S. Patent Nos. 4,603,112, 4,769,330, and 5,017,487; 
WO 89/01973; U.S. Patent No. 4,777,127; GB 2,200,651; EP 0,345,242; 
WO 91/02805; Berkner, Biotechniqws 6:616-627, 1988; Rosenfeld et al., Science 

30 252:431-434, 1991; Kolls et al., Proc. Natl. Acad. Sci. USA 97:215-219, 1994; 
Kass-Eisler et al, Proc. Natl. Acad. Sci. USA 90:11498-11502, 1993; Guzman et al., 
Circulation §5:2838-2848, 1993; and Guzman et al., Cir. Res. 73:1202-1207, 1993. 

In certain embodiments, a polynucleotide may be integrated into the 
genome of a target cell. This integration may be in the specific location and orientation 

35 via homologous recombination (gene replacement) or it may be integrated in a random, 
non-specific location (gene augmentation). In yet further embodiments, the 
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polynucleotide may be stably maintained in the cell as a separate, episomal segment of 
DNA. Such polynucleotide segments or "episomes" encode sequences sufficient to 
permit maintenance and replication independent of or in synchronization with the host 
cell cycle. The manner in which the expression construct is delivered to a cell and 
5 where in the cell the polynucleotide remains is dependent on the type of expression 
construct employed. 

In another embodiment of the invention, a polynucleotide is 
administered/delivered as "naked" DNA, for example as described in Ulmer et al., 
Science 259:1745-1749, 1993 and reviewed by Cohen, Science 259:1691-1692, 1993. 

10 The uptake of naked DNA may be increased by coating the DNA onto biodegradable 
beads, which are efficiently transported into the cells. 

hi still another embodiment, a composition of the present invention can 
be delivered via a particle bombardment approach, many of which have been described. 
In one illustrative example, gas-driven particle acceleration can be achieved with 

15 devices such as those manufactured by Powderject Pharmaceuticals PLC (Oxford, UK) 
and Powderject Vaccines Inc. (Madison, WI), some examples of which are described in 
U.S. Patent Nos. 5,846,796; 6,010,478; 5,865,796; 5,584,807; and EP Patent No. 0500 
799. This approach offers a needle-free delivery approach wherein a dry powder 
formulation of microscopic particles, such as polynucleotide or polypeptide particles, 

20 are accelerated to high speed within a helium gas jet generated by a hand held device, 
propelling the particles into a target tissue of interest. 

In a related embodiment, other devices and methods that may be useful 
for gas-driven needle-less injection of compositions of the present invention include 
those provided by Bioject, Inc. (Portland, OR), some examples of which are described 

25 in U.S. Patent Nos. 4,790,824; 5,064,413; 5,312,335; 5,383,851; 5,399,163; 5,520,639 
and 5,993,412. 

According to another embodiment, the pharmaceutical compositions 
described herein will comprise one or more immunostimulants in addition to the 
immunogenic polynucleotide, polypeptide, antibody, T-cell and/or APC compositions 

30 of this invention. An immunostimulant refers to essentially any substance that enhances 
or potentiates an immune response (antibody and/or cell-mediated) to an exogenous 
antigen. One preferred type of immunostimulant comprises an adjuvant. Many 
adjuvants contain a substance designed to protect the antigen from rapid catabolism, 
such as aluminum hydroxide or mineral oil, and a stimulator of immune responses, such 

35 as lipid A, Bortadella pertussis or Mycobacterium tuberculosis derived proteins. 
Certain adjuvants are commercially available as, for example, Freund's Incomplete 
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Adjuvant and Complete Adjuvant (Difco Laboratories, Detroit, MI); Merck Adjuvant 
65 (Merck and Company, Inc., Rahway, NJ); AS-2 (SmithKline Beecham, Philadelphia, 
PA); aluminum salts such as aluminum hydroxide gel (alum) or aluminum phosphate; 
salts of calcium, iron or zinc; an insoluble suspension of acylated tyrosine; acylated 
5 sugars; cationically or anionically derivatized polysaccharides; polyphosphazenes; 
biodegradable microspheres; monophosphoryl lipid A and quil A. Cytokines, such as 
GM-CSF, interleukin-2, -7, -12, and other like growth factors, may also be used as 
adjuvants. 

Within certain embodiments of the invention, the adjuvant composition 

10 is preferably one that induces an immune response predominantly of the Thl type. High 
levels of Thl-type cytokines (e.g., IFN-y, TNFa, IL-2 and IL-12) tend to favor the 
induction of cell mediated immune responses to an administered antigen. In contrast, 
high levels of Th2-type cytokines (e.g., IL-4, IL-5, IL-6 and IL-10) tend to favor the 
induction of humoral immune responses. Following application of a vaccine as 

1 5 provided herein, a patient will support an immune response that includes Thl - and Tin- 
type responses. Within a preferred embodiment, in which a response is predominantly 
Thl-type, the level of Thl-type cytokines will increase to a greater extent than the level 
of Th2-type cytokines. The levels of these cytokines may be readily assessed using 
standard assays. For a review of the families of cytokines, see Mosmann and Coffman, 

20 Ann. Rev. Immunol. 7:145-173, 1989. 

Certain preferred adjuvants for eliciting a predominantly Thl-type 
response include, for example, a combination of monophosphoryl lipid A, preferably 3- 
de-O-acylated monophosphoryl lipid A, together with an aluminum salt. MPL® 
adjuvants are available from Corixa Corporation (Seattle, WA; see, for example, US 

25 Patent Nos. 4,436,727; 4,877,611; 4,866,034 and 4,912,094). CpG-containing 
oligonucleotides (in which the CpG dinucleotide is unmethylated) also induce a 
predominantly Thl response. Such oligonucleotides are well known and are described, 
for example, in WO 96/02555, WO 99/33488 and U.S. Patent Nos. 6,008,200 and 
5,856,462. Immunostimulatory DNA sequences are also described, for example, by 

30 Sato et al., Science 273:352, 1996. Another preferred adjuvant comprises a saponin, 
such as Quil A, or derivatives thereof, including QS21 and QS7 (Aquila 
Biopharmaceuticals Inc., Framingham, MA); Escin; Digitonin; or Gypsophila or 
Chenopodium quinoa saponins . Other preferred formulations include more than one 
saponin in the adjuvant combinations of the present invention, for example 

35 combinations of at least two of the following group comprising QS21, QS7, Quil A, P- 
escin, or digitonin. 
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Alternatively the saponin formulations may be combined with vaccine 
vehicles composed of chitosan or other polycationic polymers, polylactide and 
polylactide-co-glycolide particles, poly-N-acetyl glucosamine-based polymer matrix, 
particles composed of polysaccharides or chemically modified polysaccharides, 
5 liposomes and lipid-based particles, particles composed of glycerol monoesters, etc. The 
saponins may also be formulated in the presence of cholesterol to form particulate 
structures such as liposomes or ISCOMs. Furthermore, the saponins may be formulated 
together with a polyoxyethylene ether or ester, in either a non-particulate solution or 
suspension, or in a particulate structure such as a paucilamelar liposome or ISCOM. The 
10 saponins may also be formulated with excipients such as Carbopol R to increase 
viscosity, or may be formulated in a dry powder form with a powder excipient such as 
lactose. 

In one preferred embodiment, the adjuvant system includes the 
combination of a monophosphoryl lipid A and a saponin derivative, such as the 

15 combination of QS21 and 3D-MPL® adjuvant, as described in WO 94/00153, or a less 
reactogenic composition where the QS21 is quenched with cholesterol, as described in 
WO 96/33739. Other preferred formulations comprise an oil-in-water emulsion and 
tocopherol. Another particularly preferred adjuvant formulation employing QS21, 3D- 
MPL® adjuvant and tocopherol in an oil-in-water emulsion is described in WO 

20 95/17210. 

Another enhanced adjuvant system involves the combination of a CpG- 
containing oligonucleotide and a saponin derivative particularly the combination of 
CpG and QS21 is disclosed in WO 00/09159. Preferably the formulation additionally 
comprises an oil in water emulsion and tocopherol. 

25 Additional illustrative adjuvants for use in the pharmaceutical 

compositions of the invention include Montanide ISA 720 (Seppic, France), SAP 
(Chiron, California, United States), ISCOMS (CSL), MF-59 (Chiron), the SBAS series 
of adjuvants (e.g., SBAS-2 or SBAS-4, available from SmithKline Beecham, Rixensart, 
Belgium), Detox (Enhanzyn®) (Corixa, Hamilton, MT), RC-529 (Corixa, Hamilton, 

30 MT) and other aminoalkyl glucosaminide 4-phosphates (AGPs), such as those described 
in pending U.S. Patent Application Serial Nos. 08/853,826 and 09/074,720, the 
disclosures of which are incorporated herein by reference in their entireties, and 
polyoxyethylene ether adjuvants such as those described in WO 99/52549A1. 

Other preferred adjuvants include adjuvant molecules of the general 

35 formula 

(I): HO(CH 2 CH 2 0) n -A-R, 
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wherein, n is 1-50, A is a bond or -C(O)-, R is Ci-so alkyl or Phenyl C1-50 alkyl. 

One embodiment of the present invention consists of a vaccine 
formulation comprising a polyoxyethylene ether of general formula (I), wherein n is 
between 1 and 50, preferably 4-24, most preferably 9; the R component is Ci. 50 , 
5 preferably C4-C20 alkyl and most preferably C12 alkyl, and i is a bond. The 
concentration of the polyoxyethylene ethers should be in the range 0.1-20%, preferably 
from 0.1-10%, and most preferably in the range 0.1-1%. Preferred polyoxyethylene 
ethers are selected from the following group: polyoxyethylene-9-lauryl ether, 
polyoxyethylene-9-steoryl ether, polyoxyethylene-8-steoryl ether, polyoxyethylene-4- 
10 lauryl ether, polyoxyethylene-35-lauryl ether, and polyoxyethylene-23-lauryl ether. 
Polyoxyethylene ethers such as polyoxyethylene lauryl ether are described in the Merck 
index (12 th edition: entry 7717). These adjuvant molecules are described in WO 
99/52549. 

The polyoxyethylene ether according to the general formula (I) above 
15 may, if desired, be combined with another adjuvant. For example, a preferred adjuvant 
combination is preferably with CpG as described in the pending UK patent application 
GB 9820956.2. 

According to another embodiment of this invention, an immunogenic 
composition described herein is delivered to a host via antigen presenting cells (APCs), 

20 such as dendritic cells, macrophages, B cells, monocytes and other cells that may be 
engineered to be efficient APCs. Such cells may, but need not, be genetically modified 
to increase the capacity for presenting the antigen, to improve activation and/or 
maintenance of the T cell response, to have anti-tumor effects per se and/or to be 
immunologically compatible with the receiver (i.e., matched HLA haplotype). APCs 

25 may generally be isolated from any of a variety of biological fluids and organs, 
including tumor and peritumoral tissues, and may be autologous, allogeneic, syngeneic 
or xenogeneic cells. 

Certain preferred embodiments of the present invention use dendritic 
cells or progenitors thereof as antigen-presenting cells. Dendritic cells are highly potent 

30 APCs (Banchereau and Steinman, Nature 502:245-251, 1998) and have been shown to 
be effective as a physiological adjuvant for eliciting prophylactic or therapeutic 
antitumor immunity (see Timmerman and Levy, Ann. Rev. Med. 50:507-529, 1999). In 
general, dendritic cells may be identified based on their typical shape (stellate in situ, 
with marked cytoplasmic processes (dendrites) visible in vitro), their ability to take up, 

35 process and present antigens with high efficiency and their ability to activate nai've T 
cell responses. Dendritic cells may, of course, be engineered to express specific cell- 
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surface receptors or ligands that are not commonly found on dendritic cells in vivo or ex 
vivo, and such modified dendritic cells are contemplated by the present invention. As 
an alternative to dendritic cells, secreted vesicles antigen-loaded dendritic cells (called 
exosomes) may be used within a vaccine (see Zitvogel et al., Nature Med. 4:594-600, 
5 1998). 

Dendritic cells and progenitors may be obtained from peripheral blood, 
bone marrow, tumor-infiltrating cells, peritumoral tissues-infiltrating cells, lymph 
nodes, spleen, skin, umbilical cord blood or any other suitable tissue or fluid. For 
example, dendritic cells may be differentiated ex vivo by adding a combination of 

10 cytokines such as GM-CSF, IL-4, IL-13 and/or TNFa to cultures of monocytes 
harvested from peripheral blood. Alternatively, CD34 positive cells harvested from 
peripheral blood, umbilical cord blood or bone marrow may be differentiated into 
dendritic cells by adding to the culture medium combinations of GM-CSF, IL-3, TNFa, 
CD40 ligand, LPS, flt3 ligand and/or other compound(s) that induce differentiation, 

1 5 maturation and proliferation of dendritic cells. 

Dendritic cells are conveniently categorized as "immature" and "mature" 
cells, which allows a simple way to discriminate between two well characterized 
phenotypes. However, this nomenclature should not be construed to exclude all 
possible intermediate stages of differentiation. Immature dendritic cells are 

20 characterized as APC with a high capacity for antigen uptake and processing, which 
correlates with the high expression of Fey receptor and mannose receptor. The mature 
phenotype is typically characterized by a lower expression of these markers, but a high 
expression of cell surface molecules responsible for T cell activation such as class I and 
class II MHC, adhesion molecules (e.g., CD54 and CD11) and costimulatory molecules 

25 (e.g., CD40, CD80, CD86 and 4-1BB). 

APCs may generally be transfected with a polynucleotide of the 
invention (or portion or other variant thereof) such that the encoded polypeptide, or an 
immunogenic portion thereof, is expressed on the cell surface. Such transfection may 
take place ex vivo, and a pharmaceutical composition comprising such transfected cells 

30 may then be used for therapeutic purposes, as described herein. Alternatively, a gene 
delivery vehicle that targets a dendritic or other antigen presenting cell may be 
administered to a patient, resulting in transfection that occurs in vivo. In vivo and ex 
vivo transfection of dendritic cells, for example, may generally be performed using any 
methods known in the art, such as those described in WO 97/24447, or the gene gun 

35 approach described by Mahvi et al., Immunology and cell Biology 75:456-460, 1997. 
Antigen loading of dendritic cells may be achieved by incubating dendritic cells or 
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progenitor cells with the tumor polypeptide, DNA (naked or within a plasmid vector) or 
RNA; or with antigen-expressing recombinant bacterium or viruses (e.g., vaccinia, 
fowlpox, adenovirus or lentivirus vectors). Prior to loading, the polypeptide may be 
covalently conjugated to an immunological partner that provides T cell help (e.g., a 
5 carrier molecule). Alternatively, a dendritic cell may be pulsed with a non-conjugated 
immunological partner, separately or in the presence of the polypeptide. 

While any suitable carrier known, to those of ordinary skill in the art may 
be employed in the pharmaceutical compositions of this invention, the type of carrier 
will typically vary depending on the mode of administration. Compositions of the 

10 present invention may be formulated for any appropriate manner of administration, 
including for example, topical, oral, nasal, mucosal, intravenous, intracranial, 
intraperitoneal, subcutaneous and intramuscular administration. 

Carriers for use within such pharmaceutical compositions are 
biocompatible, and may also be biodegradable. In certain embodiments, the 

15 formulation preferably provides a relatively constant level of active component release. 
In other embodiments, however, a more rapid rate of release immediately upon 
administration may be desired. The formulation of such compositions is well within the 
level of ordinary skill in the art using known techniques. Illustrative carriers useful in 
this regard include microparticles of poly(lactide-co-glycolide), polyacrylate, latex, 

20 starch, cellulose, dextran and the like. Other illustrative delayed-release carriers 
include supramolecular biovectors, which comprise a non-liquid hydrophilic core (e.g., 
a cross-linked polysaccharide or oligosaccharide) and, optionally, an external layer 
comprising an amphiphilic compound, such as a phospholipid (see e.g., U.S. Patent No. 
5,151,254 and PCT applications WO 94/20078, WO/94/23701 and WO 96/06638). The 

25 amount of active compound contained within a sustained release formulation depends 
upon the site of implantation, the rate and expected duration of release and the nature of 
the condition to be treated or prevented. 

In another illustrative embodiment, biodegradable microspheres (e.g., 
polylactate polyglycolate) are employed as carriers for the compositions of this 

30 invention. Suitable biodegradable microspheres are disclosed, for example, in U.S. 
Patent Nos. 4,897,268; 5,075,109; 5,928,647; 5,811,128; 5,820,883; 5,853,763; 
5,814,344, 5,407,609 and 5,942,252. Modified hepatitis B core protein carrier systems, 
such as described in WO/99 40934, and references cited therein, will also be useful for 
many applications. Another illustrative carrier/delivery system employs a carrier 

35 comprising particulate-protein complexes, such as those described in U.S. Patent No. 
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5,928,647, which are capable of inducing a class I-restricted cytotoxic T lymphocyte 
responses in a host. 

The pharmaceutical compositions of the invention will often further 
comprise one or more buffers (e.g., neutral buffered saline or phosphate buffered 
5 saline), carbohydrates (e.g., glucose, mannose, sucrose or dextrans), mannitol, proteins, 
polypeptides or amino acids such as glycine, antioxidants, bacteriostats, chelating 
agents such as EDTA or glutathione, adjuvants (e.g., aluminum hydroxide), solutes that 
render the formulation isotonic, hypotonic or weakly hypertonic with the blood of a 
recipient, suspending agents, thickening agents and/or preservatives. Alternatively, 

1 0 compositions of the present invention may be formulated as a lyophilizate. 

The pharmaceutical compositions described herein may be presented in 
unit-dose or multi-dose containers, such as sealed ampoules or vials. Such containers 
are typically sealed in such a way to preserve the sterility and stability of the 
formulation until use. In general, formulations may be stored as suspensions, solutions 

15 or emulsions in oily or aqueous vehicles. Alternatively, a pharmaceutical composition 
may be stored in a freeze-dried condition requiring only the addition of a sterile liquid 
carrier immediately prior to use. 

The development of suitable dosing and treatment regimens for using the 
particular compositions described herein in a variety of treatment regimens, including 

20 e.g., oral, parenteral, intravenous, intranasal, and intramuscular administration and 
formulation, is well known in the art, some of which are briefly discussed below for 
general purposes of illustration. 

In certain applications, the pharmaceutical compositions disclosed herein 
may be delivered via oral administration to an animal. As such, these compositions 

25 may be formulated with an inert diluent or with an assimilable edible carrier, or they 
may be enclosed in hard- or soft-shell gelatin capsule, or they may be compressed into 
tablets, or they may be incorporated directly with the food of the diet. 

The active compounds may even be incorporated with excipients and 
used in the form of ingestible tablets, buccal tables, troches, capsules, elixirs, 

30 suspensions, syrups, wafers, and the like (see, for example, Mathiowitz et ah, Nature 
1997 Mar 27;386(6623):410-4; Hwang et ah, Crit Rev Ther Drug Carrier Syst 
1998;15(3):243-84; U. S. Patent 5,641,515; U. S. Patent 5,580,579 and U. S. Patent 
5,792,451). Tablets, troches, pills, capsules and the like may also contain any of a 
variety of additional components, for example, a binder, such as gum tragacanth, acacia, 

35 cornstarch, or gelatin; excipients, such as dicalcium phosphate; a disintegrating agent, 
such as corn starch, potato starch, alginic acid and the like; a lubricant, such as 
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magnesium stearate; and a sweetening agent, such as sucrose, lactose or saccharin may 
be added or a flavoring agent, such as peppermint, oil of wintergreen, or cherry 
flavoring. When the dosage unit form is a capsule, it may contain, in addition to 
materials of the above type, a liquid carrier. Various other materials may be present as 
5 coatings or to otherwise modify the physical form of the dosage unit. For instance, 
tablets, pills, or capsules may be coated with shellac, sugar, or both. Of course, any 
material used in preparing any dosage unit form should be pharmaceutically pure and 
substantially non-toxic in the amounts employed. In addition, the active compounds 
may be incorporated into sustained-release preparation and formulations. 

10 Typically, these formulations will contain at least about 0.1% of the 

active compound or more, although the percentage of the active ingredient(s) may, of 
course, be varied and may conveniently be between about 1 or 2% and about 60% or 
70% or more of the weight or volume of the total formulation. Naturally, the amount of 
active compound(s) in each therapeutically useful composition may be prepared is such 

15 a way that a suitable dosage will be obtained in any given unit dose of the compound. 
Factors such as solubility, bioavailability, biological half-life, route of administration, 
product shelf life, as well as other pharmacological considerations will be contemplated 
by one skilled in the art of preparing such pharmaceutical formulations, and as such, a 
variety of dosages and treatment regimens may be desirable. 

20 For oral administration the compositions of the present invention may 

alternatively be incorporated with one or more excipients in the form of a mouthwash, 
dentifrice, buccal tablet, oral spray, or sublingual orally-administered formulation. 
Alternatively, the active ingredient may be incorporated into an oral solution such as 
one containing sodium borate, glycerin and potassium bicarbonate, or dispersed in a 

25 dentifrice, or added in a therapeutically-effective amount to a composition that may 
include water, binders, abrasives, flavoring agents, foaming agents, and humectants. 
Alternatively the compositions may be fashioned into a tablet or solution form that may 
be placed under the tongue or otherwise dissolved in the mouth. 

In certain circumstances it will be desirable to deliver the pharmaceutical 

30 compositions disclosed herein parenterally, intravenously, intramuscularly, or even 
intraperitoneally. Such approaches are well known to the skilled artisan, some of which 
are further described, for example, in U. S. Patent 5,543,158; U. S. Patent 5,641,515 
and U. S. Patent 5,399,363. In certain embodiments, solutions of the active compounds 
as free base or pharmacologically acceptable salts may be prepared in water suitably 

35 mixed with a surfactant, such as hydroxypropylcellulose. Dispersions may also be 
prepared in glycerol, liquid polyethylene glycols, and mixtures thereof and in oils. 
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Under ordinary conditions of storage and use, these preparations generally will contain a 
preservative to prevent the growth of microorganisms. 

Illustrative pharmaceutical forms suitable for injectable use include 
sterile aqueous solutions or dispersions and sterile powders for the extemporaneous 
5 preparation of sterile injectable solutions or dispersions (for example, see U. S. Patent 
5,466,468). In all cases the form must be sterile and must be fluid to the extent that 
easy syringability exists. It must be stable under the conditions of manufacture and 
storage and must be preserved against the contaminating action of microorganisms, 
such as bacteria and fungi. The carrier can be a solvent or dispersion medium 

10 containing, for example, water, ethanol, polyol (e.g., glycerol, propylene glycol, and 
liquid polyethylene glycol, and the like), suitable mixtures thereof, and/or vegetable 
oils. Proper fluidity may be maintained, for example, by the use of a coating, such as 
lecithin, by the maintenance of the required particle size in the case of dispersion and/or 
by the use of surfactants. The prevention of the action of microorganisms can be 

15 facilitated by various antibacterial and antifungal agents, for example, parabens, 
chlorobutanol, phenol, sorbic acid, thimerosal, and the like. In many cases, it will be 
preferable to include isotonic agents, for example, sugars or sodium chloride. 
Prolonged absorption of the injectable compositions can be brought about by the use in 
the compositions of agents delaying absorption, for example, aluminum monostearate 

20 and gelatin. 

In one embodiment, for parenteral administration in an aqueous solution, 
the solution should be suitably buffered if necessary and the liquid diluent first rendered 
isotonic with sufficient saline or glucose. These particular aqueous solutions are 
especially suitable for intravenous, intramuscular, subcutaneous and intraperitoneal 

25 administration. In this connection, a sterile aqueous medium that can be employed will 
be known to those of skill in the art in light of the present disclosure. For example, one 
dosage may be dissolved in 1 ml of isotonic NaCl solution and either added to 1000 ml 
of hypodermoclysis fluid or injected at the proposed site of infusion, (see for example, 
"Remington's Pharmaceutical Sciences" 15th Edition, pages 1035-1038 and 1570- 

30 1580). Some variation in dosage will necessarily occur depending on the condition of 
the subject being treated. Moreover, for human administration, preparations will of 
course preferably meet sterility, pyrogeniciry, and the general safety and purity 
standards as required by FDA Office of Biologies standards. 

In another embodiment of the invention, the compositions disclosed 

35 herein may be formulated in a neutral or salt form. Illustrative 
pharmaceutically-acceptable salts include the acid addition salts (formed with the free 
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amino groups of the protein) and which are formed with inorganic acids such as, for 
example, hydrochloric or phosphoric acids, or such organic acids as acetic, oxalic, 
tartaric, mandelic, and the like. Salts formed with the free carboxyl groups can also be 
derived from inorganic bases such as, for example, sodium, potassium, ammonium, 
5 calcium, or ferric hydroxides, and such organic bases as isopropylamine, 
trimethylamine, histidine, procaine and the like. Upon formulation, solutions will be 
administered in a manner compatible with the dosage formulation and in such amount 
as is therapeutically effective. 

The carriers can further comprise any and all solvents, dispersion media, 

10 vehicles, coatings, diluents, antibacterial and antifungal agents, isotonic and absorption 
delaying agents, buffers, carrier solutions, suspensions, colloids, and the like. The use 
of such media and agents for pharmaceutical active substances is well known in the art. 
Except insofar as any conventional media or agent is incompatible with the active 
ingredient, its use in the therapeutic compositions is contemplated. Supplementary 

15 active ingredients can also be incorporated into the compositions. The phrase 
"pharmaceutically-acceptable" refers to molecular entities and compositions that do not 
produce an allergic or similar untoward reaction when administered to a human. 

In certain embodiments, the pharmaceutical compositions may be 
delivered by intranasal sprays, inhalation, and/or other aerosol delivery vehicles. 

20 Methods for delivering genes, nucleic acids, and peptide compositions directly to the 
lungs via nasal aerosol sprays has been described, e.g., in U. S. Patent 5,756,353 and U. 
S. Patent 5,804,212. Likewise, the delivery of drugs using intranasal microparticle 
resins (Takenaga et ah, J Controlled Release 1998 Mar 2;52(l-2):81-7) and 
lysophosphatidyl-glycerol compounds (U. S. Patent 5,725,871) are also well-known in 

25 the pharmaceutical arts. Likewise, illustrative transmucosal drug delivery in the form of 
a polytetrafluoroetheylene support matrix is described in U. S. Patent 5,780,045. 

In certain embodiments, liposomes, nanocapsules, microparticles, lipid 
particles, vesicles, and the like, are used for the introduction of the compositions of the 
present invention into suitable host cells/organisms. In particular, the compositions of 

30 the present invention may be formulated for delivery either encapsulated in a lipid 
particle, a liposome, a vesicle, a nanosphere, or a nanoparticle or the like. Alternatively, 
compositions of the present invention can be bound, either covalently or non-covalently, 
to the surface of such carrier vehicles. 

The formation and use of liposome and liposome-like preparations as 

35 potential drug carriers is generally known to those of skill in the art (see for example, 
Lasic, Trends Biotechnol 1998 Jul;16(7):307-21; Takakura, Nippon Rinsho 1998 
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Mar;56(3):691-5; Chandran et al, Indian J Exp Biol. 1997 Aug;35(8):801-9; Margalit, 
Crit Rev Ther Drug Carrier Syst. 1995;12(2-3):233-61; U.S. Patent 5,567,434; U.S. 
Patent 5,552,157; U.S. Patent 5,565,213; U.S. Patent 5,738,868 and U.S. Patent 
5,795,587, each specifically incorporated herein by reference in its entirety). 
5 Liposomes have been used successfully with a number of cell types that 

are normally difficult to transfect by other procedures, including T cell suspensions, 
primary hepatocyte cultures and PC 12 cells (Renneisen et al, J Biol Chem. 1990 Sep 
25;265(27): 16337-42; Muller et al, DNA Cell Biol. 1990 Apr;9(3):221-9). In addition, 
liposomes are free of the DNA length constraints that are typical of viral-based delivery 

10 systems. Liposomes have been used effectively to introduce genes, various drugs, 
radiotherapeutic agents, enzymes, viruses, transcription factors, allosteric effectors and 
the like, into a variety of cultured cell lines and animals. Furthermore, he use of 
liposomes does not appear to be associated with autoimmune responses or unacceptable 
toxicity after systemic delivery. 

15 In certain embodiments, liposomes are formed from phospholipids that 

are dispersed in an aqueous medium and spontaneously form multilamellar concentric 
bilayer vesicles (also termed multilamellar vesicles (MLVs). 

Alternatively, in other embodiments, the invention provides for 
pharmaceutically-acceptable nanocapsule formulations of the compositions of the 

20 present invention. Nanocapsules can generally entrap compounds in a stable and 
reproducible way (see, for example, Quintanar-Guerrero et al, Drug Dev Ind Pharm. 
1998 Dec;24(12):l 113-28). To avoid side effects due to intracellular polymeric 
overloading, such ultrafine particles (sized around 0.1 um) may be designed using 
polymers able to be degraded in vivo. Such particles can be made as described, for 

25 example, by Couvreur etal, Crit Rev Ther Drug Carrier Syst. 1988;5(l):l-20; zur 
Muhlen et al, Eur J Pharm Biopharm. 1998 Mar;45 (2) -.149-55; Zambaux et al. J 
Controlled Release. 1998 Jan 2;50(l-3):31-40; and U. S. Patent 5,145,684. 

Cancer Therapeutic Methods 

In further aspects of the present invention, the pharmaceutical 

30 compositions described herein may be used for the treatment of cancer, particularly for 
the immunotherapy of lung cancer. Within such methods, the pharmaceutical 
compositions described herein are administered to a patient, typically a warm-blooded 
animal, preferably a human. A patient may or may not be afflicted with cancer. 
Accordingly, the above pharmaceutical compositions may be used to prevent the 

35 development of a cancer or to treat a patient afflicted with a cancer. Pharmaceutical 
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compositions and vaccines may be administered either prior to or following surgical 
removal of primary tumors and/or treatment such as administration of radiotherapy or 
conventional chemotherapeutic drugs. As discussed above, administration of the 
pharmaceutical compositions may be by any suitable method, including administration 
5 by intravenous, intraperitoneal, intramuscular, subcutaneous, intranasal, intradermal, 
anal, vaginal, topical and oral routes. 

Within certain embodiments, immunotherapy may be active 
immunotherapy, in which treatment relies on the in vivo stimulation of the endogenous 
host immune system to react against tumors with the administration of immune 
10 response-modifying agents (such as polypeptides and polynucleotides as provided 
herein). 

Within other embodiments, immunotherapy may be passive 
immunotherapy, in which treatment involves the delivery of agents with established 
tumor-immune reactivity (such as effector cells or antibodies) that can directly or 

15 indirectly mediate antitumor effects and does not necessarily depend on an intact host 
immune system. Examples of effector cells include T cells as discussed above, T 
lymphocytes (such as CD8 + cytotoxic T lymphocytes and CD4 + T-helper tumor- 
infiltrating lymphocytes), killer cells (such as Natural Killer cells and lymphokine- 
activated killer cells), B cells and antigen-presenting cells (such as dendritic cells and 

20 macrophages) expressing a polypeptide provided herein. T cell receptors and antibody 
receptors specific for the polypeptides recited herein may be cloned, expressed and 
transferred into other vectors or effector cells for adoptive immunotherapy. The 
polypeptides provided herein may also be used to generate antibodies or anti-idiotypic 
antibodies (as described above and in U.S. Patent No. 4,918,164) for passive 

25 immunotherapy. 

Effector cells may generally be obtained in sufficient quantities for 
adoptive immunotherapy by growth in vitro, as described herein. Culture conditions for 
expanding single antigen-specific effector cells to several billion in number with 
retention of antigen recognition in vivo are well known in the art. Such in vitro culture 

30 conditions typically use intermittent stimulation with antigen, often in the presence of 
cytokines (such as IL-2) and non-dividing feeder cells. As noted above, 
immunoreactive polypeptides as provided herein may be used to rapidly expand 
antigen-specific T cell cultures in order to generate a sufficient number of cells for 
immunotherapy. In particular, antigen-presenting cells, such as dendritic, macrophage, 

35 monocyte, fibroblast and/or B cells, may be pulsed with immunoreactive polypeptides 
or transfected with one or more polynucleotides using standard techniques well known 
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in the art. For example, antigen-presenting cells can be transfected with a 
polynucleotide having a promoter appropriate for increasing expression in a 
recombinant virus or other expression system. Cultured effector cells for use in therapy 
must be able to grow and distribute widely, and to survive long term in vivo. Studies 
5 have shown that cultured effector cells can be induced to grow in vivo and to survive 
long term in substantial numbers by repeated stimulation with antigen supplemented 
with IL-2 (see, for example, Cheever et al., Immunological Reviews 157:111, 1997). 

Alternatively, a vector expressing a polypeptide recited herein may be 
introduced into antigen presenting cells taken from a patient and clonally propagated ex 

10 vivo for transplant back into the same patient. Transfected cells may be reintroduced 
into the patient using any means known in the art, preferably in sterile form by 
intravenous, intracavitary, intraperitoneal or intratumor administration. 

Routes and frequency of administration of the therapeutic compositions 
described herein, as well as dosage, will vary from individual to individual, and may be 

15 readily established using standard techniques. In general, the pharmaceutical 
compositions and vaccines may be administered by injection (e.g., intracutaneous, 
intramuscular, intravenous or subcutaneous), intranasally (e.g., by aspiration) or orally. 
Preferably, between 1 and 10 doses may be administered over a 52 week period. 
Preferably, 6 doses are administered, at intervals of 1 month, and booster vaccinations 

20 may be given periodically thereafter. Alternate protocols may be appropriate for 
individual patients. A suitable dose is an amount of a compound that, when 
administered as described above, is capable of promoting an anti-tumor immune 
response, and is at least 10-50% above the basal (i.e., untreated) level. Such response 
can be monitored by measuring the anti-tumor antibodies in a patient or by vaccine- 

25 dependent generation of cytolytic effector cells capable of killing the patient's tumor 
cells in vitro. Such vaccines should also be capable of causing an immune response that 
leads to an improved clinical outcome (e.g., more frequent remissions, complete or 
partial or longer disease-free survival) in vaccinated patients as compared to non- 
vaccinated patients. In general, for pharmaceutical compositions and vaccines 

30 comprising one or more polypeptides, the amount of each polypeptide present in a dose 
ranges from about 25 ug to 5 mg per kg of host. Suitable dose sizes will vary with the 
size of the patient, but will typically range from about 0.1 mL to about 5 mL. 

In general, an appropriate dosage and treatment regimen provides the 
active compound(s) in an amount sufficient to provide therapeutic and/or prophylactic 

35 benefit. Such a response can be monitored by establishing an improved clinical 
outcome (e.g., more frequent remissions, complete or partial, or longer disease-free 
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survival) in treated patients as compared to non-treated patients. Increases in 
preexisting immune responses to a tumor protein generally correlate with an improved 
clinical outcome. Such immune responses may generally be evaluated using standard 
proliferation, cytotoxicity or cytokine assays, which may be performed using samples 
5 obtained from a patient before and after treatment. 

Cancer Detection and Diagnostic Compositions, Methods and Kits 

In general, a cancer may be detected in a patient based on the presence of 
one or more lung tumor proteins and/or polynucleotides encoding such proteins in a 
biological sample (for example, blood, sera, sputum urine and/or tumor biopsies) 

10 obtained from the patient. In other words, such proteins may be used as markers to 
indicate the presence or absence of a cancer such as lung cancer. In addition, such 
proteins may be useful for the detection of other cancers. The binding agents provided 
herein generally permit detection of the level of antigen that binds to the agent in the 
biological sample. Polynucleotide primers and probes may be used to detect the level of 

15 mRNA encoding a tumor protein, which is also indicative of the presence or absence of 
a cancer. In general, a lung tumor sequence should be present at a level that is at least 
three fold higher in tumor tissue than in normal tissue 

There are a variety of assay formats known to those of ordinary skill in 
the art for using a binding agent to detect polypeptide markers in a sample. See, e.g., 

20 Harlow and Lane, Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory, 
1988. In general, the presence or absence of a cancer in a patient may be determined by 
(a) contacting a biological sample obtained from a patient with a binding agent; (b) 
detecting in the sample a level of polypeptide that binds to the binding agent; and (c) 
comparing the level of polypeptide with a predetermined cut-off value. 

25 In a preferred embodiment, the assay involves the use of binding agent 

immobilized on a solid support to bind to and remove the polypeptide from the 
remainder of the sample. The bound polypeptide may then be detected using a detection 
reagent that contains a reporter group and specifically binds to the binding 
agent/polypeptide complex. Such detection reagents may comprise, for example, a 

30 binding agent that specifically binds to the polypeptide or an antibody or other agent 
that specifically binds to the binding agent, such as an anti-immunoglobulin, protein G, 
protein A or a lectin. Alternatively, a competitive assay may be utilized, in which a 
polypeptide is labeled with a reporter group and allowed to bind to the immobilized 
binding agent after incubation of the binding agent with the sample. The extent to 

35 which components of the sample inhibit the binding of the labeled polypeptide to the 
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binding agent is indicative of the reactivity of the sample with the immobilized binding 
agent. Suitable polypeptides for use within such assays include full length lung tumor 
proteins and polypeptide portions thereof to which the binding agent binds, as described 
above. 

5 The solid support may be any material known to those of ordinary skill 

in the art to which the tumor protein may be attached. For example, the solid support 
may be a test well in a microtiter plate or a nitrocellulose or other suitable membrane. 
Alternatively, the support may be a bead or disc, such as glass, fiberglass, latex or a 
plastic material such as polystyrene or polyvinylchloride. The support may also be a 

10 magnetic particle or a fiber optic sensor, such as those disclosed, for example, in U.S. 
Patent No. 5,359,681. The binding agent may be immobilized on the solid support 
using a variety of techniques known to those of skill in the art, which are amply 
described in the patent and scientific literature. In the context of the present invention, 
the term "immobilization" refers to both noncovalent association, such as adsorption, 

15 and covalent attachment (which may be a direct linkage between the agent and 
functional groups on the support or may be a linkage by way of a cross-linking agent). 
Immobilization by adsorption to a well in a microtiter plate or to a membrane is 
preferred. In such cases, adsorption may be achieved by contacting the binding agent, in 
a suitable buffer, with the solid support for a suitable amount of time. The contact time 

20 varies with temperature, but is typically between about 1 hour and about 1 day. In 
general, contacting a well of a plastic microtiter plate (such as polystyrene or 
polyvinylchloride) with an amount of binding agent ranging from about 10 ng to about 
10 jag, and preferably about 100 ng to about 1 jag, is sufficient to immobilize an 
adequate amount of binding agent. 

25 Covalent attachment of binding agent to a solid support may generally be 

achieved by first reacting the support with a bifunctional reagent that will react with 
both the support and a functional group, such as a hydroxyl or amino group, on the 
binding agent. For example, the binding agent may be covalently attached to supports 
having an appropriate polymer coating using benzoquinone or by condensation of an 

30 aldehyde group on the support with an amine and an active hydrogen on the binding 
partner {see, e.g., Pierce Immunotechnology Catalog and Handbook, 1991, at 
A12-A13). 

In certain embodiments, the assay is a two-antibody sandwich assay. 
This assay may be performed by first contacting an antibody that has been immobilized 
35 on a solid support, commonly the well of a microtiter plate, with the sample, such that 
polypeptides within the sample are allowed to bind to the immobilized antibody. 
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Unbound sample is then removed from the immobilized polypeptide-antibody 
complexes and a detection reagent (preferably a second antibody capable of binding to a 
different site on the polypeptide) containing a reporter group is added. The amount of 
detection reagent that remains bound to the solid support is then determined using a 
5 method appropriate for the specific reporter group. 

More specifically, once the antibody is immobilized on the support as 
described above, the remaining protein binding sites on the support are typically 
blocked. Any suitable blocking agent known to those of ordinary skill in the art, such as 
bovine serum albumin or Tween 20™ (Sigma Chemical Co., St. Louis, MO). The 

10 immobilized antibody is then incubated with the sample, and polypeptide is allowed to 
bind to the antibody. The sample may be diluted with a suitable diluent, such as 
phosphate-buffered saline (PBS) prior to incubation. In general, an appropriate contact 
time (i.e., incubation time) is a period of time that is sufficient to detect the presence of 
polypeptide within a sample obtained from an individual with lung cancer. Preferably, 

15 the contact time is sufficient to achieve a level of binding that is at least about 95% of 
that achieved at equilibrium between bound and unbound polypeptide. Those of 
ordinary skill in the art will recognize that the time necessary to achieve equilibrium 
may be readily determined by assaying the level of binding that occurs over a period of 
time. At room temperature, an incubation time of about 30 minutes is generally 

20 sufficient. 

Unbound sample may then be removed by washing the solid support 
with an appropriate buffer, such as PBS containing 0.1% Tween 20™. The second 
antibody, which contains a reporter group, may then be added to the solid support. 
Preferred reporter groups include those groups recited above. 

25 The detection reagent is then incubated with the immobilized antibody- 

polypeptide complex for an amount of time sufficient to detect the bound polypeptide. 
An appropriate amount of time may generally be determined by assaying the level of 
binding that occurs over a period of time. Unbound detection reagent is then removed 
and bound detection reagent is detected using the reporter group. The method employed 

30 for detecting the reporter group depends upon the nature of the reporter group. For 
radioactive groups, scintillation counting or autoradiographic methods are generally 
appropriate. Spectroscopic methods may be used to detect dyes, luminescent groups 
and fluorescent groups. Biotin may be detected using avidin, coupled to a different 
reporter group (commonly a radioactive or fluorescent group or an enzyme). Enzyme 

35 reporter groups may generally be detected by the addition of substrate (generally for a 
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specific period of time), followed by spectroscopic or other analysis of the reaction 
products. 

To determine the presence or absence of a cancer, such as lung cancer, 
the signal detected from the reporter group that remains bound to the solid support is 
5 generally compared to a signal that corresponds to a predetermined cut-off value. In 
one preferred embodiment, the cut-off value for the detection of a cancer is the average 
mean signal obtained when the immobilized antibody is incubated with samples from 
patients without the cancer. In general, a sample generating a signal that is three 
standard deviations above the predetermined cut-off value is considered positive for the 

10 cancer. In an alternate preferred embodiment, the cut-off value is determined using a 
Receiver Operator Curve, according to the method of Sackett et al., Clinical 
Epidemiology: A Basic Science for Clinical Medicine, Little Brown and Co., 1985, 
p. 106-7. Briefly, in this embodiment, the cut-off value may be determined from a plot 
of pairs of true positive rates (i.e., sensitivity) and false positive rates (100%-specificity) 

15 that correspond to each possible cut-off value for the diagnostic test result. The cut-off 
value on the plot that is the closest to the upper left-hand corner (i.e., the value that 
encloses the largest area) is the most accurate cut-off value, and a sample generating a 
signal that is higher than the cut-off value determined by this method may be considered 
positive. Alternatively, the cut-off value may be shifted to the left along the plot, to 

20 minimize the false positive rate, or to the right, to minimize the false negative rate. In 
general, a sample generating a signal that is higher than the cut-off value determined by 
this method is considered positive for a cancer. 

In a related embodiment, the assay is performed in a flow-through or 
strip test format, wherein the binding agent is immobilized on a membrane, such as 

25 nitrocellulose. In the flow-through test, polypeptides within the sample bind to the 
immobilized binding agent as the sample passes through the membrane. A second, 
labeled binding agent then binds to the binding agent-polypeptide complex as a solution 
containing the second binding agent flows through the membrane. The detection of 
bound second binding agent may then be performed as described above. In the strip test 

30 format, one end of the membrane to which binding agent is bound is immersed in a 
solution containing the sample. The sample migrates along the membrane through a 
region containing second binding agent and to the area of immobilized binding agent. 
Concentration of second binding agent at the area of immobilized antibody indicates the 
presence of a cancer. Typically, the concentration of second binding agent at that site 

35 generates a pattern, such as a line, that can be read visually. The absence of such a 
pattern indicates a negative result. In general, the amount of binding agent immobilized 
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on the membrane is selected to generate a visually discernible pattern when the 
biological sample contains a level of polypeptide that would be sufficient to generate a 
positive signal in the two-antibody sandwich assay, in the format discussed above. 
Preferred binding agents for use in such assays are antibodies and antigen-binding 
5 fragments thereof. Preferably, the amount of antibody immobilized on the membrane 
ranges from about 25 ng to about lug, and more preferably from about 50 ng to about 
500 ng. Such tests can typically be performed with a very small amount of biological 
sample. 

Of course, numerous other assay protocols exist that are suitable for use 

10 with the tumor proteins or binding agents of the present invention. The above 
descriptions are intended to be exemplary only. For example, it will be apparent to 
those of ordinary skill in the art that the above protocols may be readily modified to use 
tumor polypeptides to detect antibodies that bind to such polypeptides in a biological 
sample. The detection of such tumor protein specific antibodies may correlate with the 

1 5 presence of a cancer. 

A cancer may also, or alternatively, be detected based on the presence of 
T cells that specifically react with a tumor protein in a biological sample. Within 
certain methods, a biological sample comprising CD4 + and/or CD8 + T cells isolated 
from a patient is incubated with a tumor polypeptide, a polynucleotide encoding such a 

20 polypeptide and/or an APC that expresses at least an immunogenic portion of such a 
polypeptide, and the presence or absence of specific activation of the T cells is detected. 
Suitable biological samples include, but are not limited to, isolated T cells. For 
example, T cells may be isolated from a patient by routine techniques (such as by 
Ficoll/Hypaque density gradient centrifugation of peripheral blood lymphocytes). T 

25 cells may be incubated in vitro for 2-9 days (typically 4 days) at 37°C with polypeptide 
(e.g., 5-25 ug/ml). It may be desirable to incubate another aliquot of a T cell sample in 
the absence of tumor polypeptide to serve as a control. For CD4 + T cells, activation is 
preferably detected by evaluating proliferation of the T cells. For CD8 + T cells, 
activation is preferably detected by evaluating cytolytic activity. A level of proliferation 

30 that is at least two fold greater and/or a level of cytolytic activity that is at least 20% 
greater than in disease-free patients indicates the presence of a cancer in the patient. 

As noted above, a cancer may also, or alternatively, be detected based on 
the level of mRNA encoding a tumor protein in a biological sample. For example, at 
least two oligonucleotide primers may be employed in a polymerase chain reaction 

35 (PCR) based assay to amplify a portion of a tumor cDNA derived from a biological 
sample, wherein at least one of the oligonucleotide primers is specific for (i.e., 
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hybridizes to) a polynucleotide encoding the tumor protein. The amplified cDNA is 
then separated and detected using techniques well known in the art, such as gel 
electrophoresis. Similarly, oligonucleotide probes that specifically hybridize to a 
polynucleotide encoding a tumor protein may be used in a hybridization assay to detect 
5 the presence of polynucleotide encoding the tumor protein in a biological sample. 

To permit hybridization under assay conditions, oligonucleotide primers 
and probes should comprise an oligonucleotide sequence that has at least about 60%, 
preferably at least about 75% and more preferably at least about 90%, identity to a 
portion of a polynucleotide encoding a tumor protein of the invention that is at least 1 0 

10 nucleotides, and preferably at least 20 nucleotides, in length. Preferably, 
oligonucleotide primers and/or probes hybridize to a polynucleotide encoding a 
polypeptide described herein under moderately stringent conditions, as defined above. 
Oligonucleotide primers and/or probes which may be usefully employed in the 
diagnostic methods described herein preferably are at least 10-40 nucleotides in length. 

15 ha a preferred embodiment, the oligonucleotide primers comprise at least 10 contiguous 
nucleotides, more preferably at least 15 contiguous nucleotides, of a DNA molecule 
having a sequence as disclosed herein. Techniques for both PCR based assays and 
hybridization assays are well known in the art {see, for example, Mullis et al, Cold 
Spring Harbor Symp. Quant. Biol, 51:263, 1987; Erlich ed., PCR Technology, Stockton 

20 Press, NY, 1989). 

One preferred assay employs RT-PCR, in which PCR is applied in 
conjunction with reverse transcription. Typically, RNA is extracted from a biological 
sample, such as biopsy tissue, and is reverse transcribed to produce cDNA molecules. 
PCR amplification using at least one specific primer generates a cDNA molecule, which 

25 may be separated and visualized using, for example, gel electrophoresis. Amplification 
may be performed on biological samples taken from a test patient and from an 
individual who is not afflicted with a cancer. The amplification reaction may be 
performed on several dilutions of cDNA spanning two orders of magnitude. A two-fold 
or greater increase in expression in several dilutions of the test patient sample as 

30 compared to the same dilutions of the non-cancerous sample is typically considered 
positive. 

In another embodiment, the compositions described herein may be used 
as markers for the progression of cancer. In this embodiment, assays as described above 
for the diagnosis of a cancer may be performed over time, and the change in the level of 
35 reactive polypeptide(s) or polynucleotide(s) evaluated. For example, the assays may be 
performed every 24-72 hours for a period of 6 months to 1 year, and thereafter 
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performed as needed. In general, a cancer is progressing in those patients in whom the 
level of polypeptide or polynucleotide detected increases over time. In contrast, the 
cancer is not progressing when the level of reactive polypeptide or polynucleotide either 
remains constant or decreases with time. 
5 Certain in vivo diagnostic assays may be performed directly on a tumor. 

One such assay involves contacting tumor cells with a binding agent. The bound 
binding agent may then be detected directly or indirectly via a reporter group. Such 
binding agents may also be used in histological applications. Alternatively, 
polynucleotide probes may be used within such applications. 

10 As noted above, to improve sensitivity, multiple tumor protein markers 

may be assayed within a given sample. It will be apparent that binding agents specific 
for different proteins provided herein may be combined within a single assay. Further, 
multiple primers or probes may be used concurrently. The selection of tumor protein 
markers may be based on routine experiments to determine combinations that results in 

15 optimal sensitivity. In addition, or alternatively, assays for tumor proteins provided 
herein may be combined with assays for other known tumor antigens. 

The present invention further provides kits for use within any of the 
above diagnostic methods. Such kits typically comprise two or more components 
necessary for performing a diagnostic assay. Components may be compounds, reagents, 

20 containers and/or equipment. For example, one container within a kit may contain a 
monoclonal antibody or fragment thereof that specifically binds to a tumor protein. 
Such antibodies or fragments may be provided attached to a support material, as 
described above. One or more additional containers may enclose elements, such as 
reagents or buffers, to be used in the assay. Such kits may also, or alternatively, contain 

25 a detection reagent as described above that contains a reporter group suitable for direct 
or indirect detection of antibody binding. 

Alternatively, a kit may be designed to detect the level of mRNA 
encoding a tumor protein in a biological sample. Such kits generally comprise at least 
one oligonucleotide probe or primer, as described above, that hybridizes to a 

30 polynucleotide encoding a tumor protein. Such an oligonucleotide may be used, for 
example, within a PCR or hybridization assay. Additional components that may be 
present within such kits include a second oligonucleotide and/or a diagnostic reagent or 
container to facilitate the detection of a polynucleotide encoding a tumor protein. 

The following Examples are offered by way of illustration and not by 

3 5 way of limitation. 
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Example 1 

Preparation Of Lung Tumor-Specific Cdna Sequences Using Differential 
Display Rt-Pcr 

5 This example illustrates the preparation of cDNA molecules encoding 

lung tumor-specific polypeptides using a differential display screen. 

Tissue samples were prepared from lung tumor and normal tissue of a 
patient with lung cancer that was confirmed by pathology after removal of samples from 
the patient. Normal RNA and tumor RNA was extracted from the samples and mRNA 

10 was isolated and converted into cDNA using a (dT)i2AG (SEQ ID NO: 47) anchored 3' 
primer. Differential display PCR was then executed using a randomly chosen primer 
(SEQ ID NO: 48). Amplification conditions were standard buffer containing 1.5 mM 
MgCl 2 , 20 pmol of primer, 500 pmol dNTP and 1 unit of Taq DNA polymerase (Perkin- 
Elmer, Branchburg, NJ). Forty cycles of amplification were performed using 94 °C 

15 denaturation for 30 seconds, 42 °C annealing for 1 minute and 72 °C extension for 30 
seconds. Bands that were repeatedly observed to be specific to the RNA fingerprint 
pattern of the tumor were cut out of a silver stained gel, subcloned into the pGEM-T 
vector (Promega, Madison, WI) and sequenced. The isolated 3' sequences are provided 
in SEQ ID NO: 1-16. 

20 Comparison of these sequences to those in the public databases using the 

BLASTN program, revealed no significant homologies to the sequences provided in 
SEQ ID NOs:l-l 1. To the best of the inventors' knowledge, none of the isolated DNA 
sequences have previously been shown to be expressed at a greater level in human lung 
tumor tissue than in normal lung tissue. 

25 

Example 2 

Use of Patient Sera to Identify dna Sequences Encoding 
Lung tumor antigens 
30 This example illustrates the isolation of cDNA sequences encoding lung 

tumor antigens by expression screening of lung tumor samples with autologous patient 
sera. 

A human lung tumor directional cDNA expression library was 
constructed employing the Lambda ZAP Express expression system (Stratagene, La 
35 Jolla, CA). Total RNA for the library was taken from a late SCID mouse passaged 
human squamous epithelial lung carcinoma and poly A+ RNA was isolated using the 
Message Maker kit (Gibco BRL, Gaithersburg, MD). The resulting library was 
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screened using E. co/z'-absorbed autologous patient serum, as described in Sambrook et 
al., (Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratories, Cold 
Spring Harbor, NY, 1989), with the secondary antibody being goat anti-human IgG-A- 
M (H + L) conjugated with alkaline phosphatase, developed with NBT/BCIP (Gibco 
5 BRL) <;> Positive plaques expressing immunoreactive antigens were purified. Phagemid 
from the plaques was rescued and the nucleotide sequences of the clones was 
determined. 

Fifteen clones were isolated, referred to hereinafter as LT86-1 - LT86- 
15. The isolated cDNA sequences for LT86-1 - LT86-8 and LT86-10 - LT86-15 are 

10 provided in SEQ ID NO: 17-24 and 26-31, respectively, with the corresponding 
predicted amino acid sequences being provided in SEQ ID NOs:32-39 and 41-46, 
respectively. The determined cDNA sequence for LT86-9 is provided in SEQ ID 
NO:25, with the corresponding predicted amino acid sequences from the 3' and 5' ends 
being provided in SEQ ID NOs:40 and 65, respectively. These sequences were 

15 compared to those in the gene bank as described above. Clones LT86-3, LT86-6 - 
LT86-9, LT86-11 - LT86-13 and LT86-15 (SEQ ID NO: 19, 22-25, 27-29 and 31, 
respectively) were found to show some homology to previously identified expressed 
sequence tags (ESTs), with clones LT86-6, LT86-8, LT86-11, LT86-12 and LT86-15 
appearing to be similar or identical to each other. Clone LT86-3 was found to show 

20 some homology with a human transcription repressor. Clones LT86-6, 8, 9, 11, 12 and 
15 were found to show some homology to a yeast RNA Pol II transcription regulation 
mediator. Clone LT86-13 was found to show some homology with a C. elegans leucine 
aminopeptidase. Clone LT86-9 appears to contain two inserts, with the 5' sequence 
showing homology to the previously identified antisense sequence of interferon alpha- 

25 induced P27, and the 3' sequence being similar to LT86-6. Clone LT86-14 (SEQ ID 
NO: 30) was found to show some homology to the trithorax gene and has an "RGD" cell 
attachment sequence and a beta-Lactamase A site which functions in hydrolysis of 
penicillin. Clones LT86-1, LT86-2, LT86-4, LT86-5 and LT86-10 (SEQ ID NOs:17, 
18, 20, 21 and 26, respectively) were found to show homology to previously identified 

30 genes. A subsequently determined extended cDNA sequence for LT86-4 is provided in 
SEQ ID NO:66, with the corresponding predicted amino acid sequence being provided 
in SEQ ID NO: 67. 

Subsequent studies led to the isolation of five additional clones, referred 
to as LT86-20, LT86-21, LT86-22, LT86-26 and LT86-27. The determined 5' cDNA 

35 sequences for LT86-20, LT86-22, LT86-26 and LT86-27 are provided in SEQ ID NO: 
68 and 70-72, respectively, with the detennined 3' cDNA sequences for LT86-21 being 
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provided in SEQ ID NO: 69. The corresponding predicted amino acid sequences for 
LT86-20, LT86-21, LT86-22, LT86-26 and LT86-27 are provided in SEQ ID NO: 73- 
77, respectively. LT86-22 and LT86-27 were found to be highly similar to each other. 
Comparison of these sequences to those in the gene bank as described above, revealed 
5 no significant homologies to LT86-22 and LT86-27. LT86-20, LT86-21 and LT86-26 
were found to show homology to previously identified genes. 

In further studies, a cDNA expression library was prepared using mRNA 
from a lung small cell carcinoma cell line in the lambda ZAP Express expression vector 
(Stratagene), and screened as described above, with a pool of two lung small cell 

10 carcinoma patient sera. The sera pool was adsorbed with E. coli lysate and human 
PBMC lysate was added to the serum to block antibody to proteins found in normal 
tissue. Seventy-three clones were isolated. The determined cDNA sequences of these 
clones are provided in SEQ ID NO: 290-362. The sequences of SEQ ID NO: 289-292, 
294, 296-297, 300, 302, 303, 305, 307-315, 317-320, 322-325, 327-332, 334, 335, 338- 

15 341, 343-352, 354-358, 360 and 362 were found to show some homology to previously 
isolated genes. The sequences of SEQ ID NO: 293, 295, 298, 299, 301, 304, 306, 316, 
321, 326, 333, 336, 337, 342, 353, 359 and 361 were found to show some homology to 
previously identified ESTs. 

20 Example 3 

Use of Mouse Antisera to Identify dna Sequences Encoding 
Lung Tumor Antigens 
This example illustrates the isolation of cDNA sequences encoding lung 
tumor antigens by screening of lung tumor cDNA libraries with mouse anti-tumor sera. 
25 A directional cDNA lung tumor expression library was prepared as 

described above in Example 2. Sera was obtained from SCID mice containing late 
passaged human squamous cell and adenocarcinoma tumors. These sera were pooled 
and injected into normal mice to produce anti-lung tumor serum. Approximately 
200,000 PFUs were screened from the unamplified library using this antiserum. Using 
30 a goat anti-mouse IgG-A-M (H+L) alkaline phosphatase second antibody developed 
with NBT/BCIP (BRL Labs.), approximately 40 positive plaques were identified. 
Phage was purified and phagemid excised for 9 clones with inserts in a pBK-CMV 
vector for expression in prokaryotic or eukaryotic cells. 

The determined cDNA sequences for 7 of the isolated clones (hereinafter 
35 referred to as L86S-3, L86S-12, L86S-16, L86S-25, L86S-36, L86S-40 and L86S-46) 
are provided in SEQ ID NO: 49-55, with the corresponding predicted amino acid 
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sequences being provided in SEQ ID NO: 56-62, respectively. The 5' cDNA sequences 
for the remaining 2 clones (hereinafter referred to as L86S-30 and L86S-41) are 
provided in SEQ ID NO: 63 and 64. L86S-36 and L86S-46 were subsequently 
determined to represent the same gene. Comparison of these sequences with those in 
5 the public database as described above, revealed no significant homologies to clones 
L86S-30, L86S-36 and L86S-46 (SEQ ID NO: 63, 53 and 55, respectively). L86S-16 
(SEQ ID NO: 51) was found to show some homology to an EST previously identified in 
fetal lung and germ cell tumor. The remaining clones were found to show at least some 
degree of homology to previously identified human genes. Subsequently determined 

10 extended cDNA sequences for L86S-12, L86S-36 and L86S-46 are provided in SEQ ID 
NO: 78-80, respectively, with the corresponding predicted amino acid sequences being 
provided in SEQ ID NO: 81-83. 

Subsequent studies led to the determination of 5' cDNA sequences for an 
additional nine clones, referred to as L86S-6, L86S-11, L86S-14, L86S-29, L86S-34, 

15 L86S-39, L86S-47, L86S-49 and L86S-51 (SEQ ID NO: 84-92, respectively). The 
corresponding predicted amino acid sequences are provided in SEQ ID NO: 93-101, 
respectively. L86S-30, L86S-39 and L86S-47 were found to be similar to each other. 
Comparison of these sequences with those in the gene bank as described above, 
revealed no significant homologies to L86S-14. L86S-29 was found to show some 

20 homology to a previously identified EST. L86S-6, L86S-1 1, L86S-34, L86S-39, L86S- 
47, L86S-49 and L86S-51 were found to show some homology to previously identified 
genes. 

In further studies, a directional cDNA library was constructed using a 
Stratagene kit with a Lambda Zap Express vector. Total RNA for the library was 

25 isolated from two primary squamous lung tumors and poly A+ RNA was isolated using 
an oligo dT column. Antiserum was developed in normal mice using a pool of sera 
from three SCID mice implanted with human squamous lung carcinomas. 
Approximately 700,000 PFUs were screened from the unamplified library with E. coli 
absorbed mouse anti-SCID tumor serum. Positive plaques were identified as described 

30 above. Phage was purified and phagemid excised for 180 clones with inserts in a pBK- 
CMV vector for expression in prokaryotic or eukaryotic cells. 

The determined cDNA sequences for 23 of the isolated clones are 
provided in SEQ ID NO: 126-148. Comparison of these sequences with those in the 
public database as described above revealed no significant homologies to the sequences 

35 of SEQ ID NO: 139 and 143-148. The sequences of SEQ ID NO: 126-138 and 140-142 
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were found to show homology to previously identified human polynucleotide 
sequences. 

Example 4 

5 Use of Mouse Antisera to Screen Lung Tumor Libraries 

Prepared from Scid Mice 
This example illustrates the isolation of cDNA sequences encoding lung 
tumor antigens by screening of lung tumor cDNA libraries prepared from SCID mice 
with mouse anti-tumor sera. 

10 A directional cDNA lung tumor expression library was prepared using a 

Stratagene kit with a Lambda Zap Express vector. Total RNA for the library was taken 
from a late passaged lung adenocarcinoma grown in SCID mice. Poly A+ RNA was 
isolated using a Message Maker Kit (Gibco BRL). Sera was obtained from two SCID 
mice implanted with lung adenocarcinomas. These sera were pooled and injected into 

15 normal mice to produce anti-lung tumor serum. Approximately 700,000 PFUs were 
screened from the unamplified library with E. co/z'-absorbed mouse anti-SCID tumor 
serum. Positive plaques were identified with a goat anti-mouse IgG-A-M (H+L) 
alkaline phosphatase second antibody developed with NBT/BCIP (Gibco BRL). Phage 
was purified and phagemid excised. for 100 clones with insert in a pBK-CMV vector for 

20 expression in prokaryotic or eukaryotic cells. 

The determined 5' cDNA sequences for 33 of the isolated clones are 
provided in SEQ ID NO: 149-181. The corresponding predicted amino acid sequences 
for SEQ ID NO: 149, 150, 152-154, 156-158 and 160-181 are provided in SEQ ID NO: 
182, 183, 186, 188-193 and 194-215, respectively. The clone of SEQ ID NO: 151 

25 (referred to as SAL-25) was found to contain two open reading frames (ORFs). The 
predicted amino acid sequences encoded by these ORFs are provided in SEQ ID NO: 
184 and 185. The clone of SEQ ID NO: 153 (referred to as SAL-50) was found to 
contain two open reading frames encoding the predicted amino acid sequences of SEQ 
ID NO: 187 and 216. Similarly, the clone of SEQ ID NO: 155 (referred to as SAL-66) 

30 was found to contain two open reading frames encoding the predicted amino acid 
sequences of SEQ ID NO: 189 and 190. Comparison of the isolated sequences with 
those in the public database revealed no significant homologies to the sequences of SEQ 
ID NO: 151, 153 and 154. The sequences of SEQ ID NO: 149, 152, 156, 157 and 158 
were found to show some homology to previously isolated expressed sequence tags 

35 (ESTs). The sequences of SEQ ID NO: 150, 155 and 159-181 were found to show 
homology to sequences previously identified in humans. 
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Using the procedures described above, two directional cDNA libraries 
(referred to as LT46-90 and LT86-21) were prepared from two late passaged lung 
squamous carcinomas grown in SOD mice and screened with sera obtained from SCID 
mice implanted with human squamous lung carcinomas. The determined cDNA 
5 sequences for the isolated clones are provided in SEQ ID NO: 217-237 and 286-289. 
SEQ ID NO: 286 was found to be a longer sequence of LT4690-71 (SEQ ID NO: 237). 
Comparison of these sequences with those in the public databases revealed no known 
homologies to the sequences of SEQ ID NO: 219, 220, 225, 226, 287 and 288. The 
sequences of SEQ ID NO: 218, 221, 222 and 224 were found to show some homology 

10 to previously identified sequences of unknown function. The sequence of SEQ ID NO: 
236 was found to show homology to a known mouse mRNA sequence. The sequences 
of SEQ ID NO: 217, 223, 227-237, 286 and 289 showed some homology to known 
human DNA and/or RNA sequences. 

In further studies using the techniques described above, one of the cDNA 

15 libraries described above (LT86-21) was screened with E. co/z'-absorbed mouse anti- 
SCID tumor serum. This serum was obtained from normal mice immunized with a pool 
of 3 sera taken from SCID mice implanted with human squamous lung carcinomas. The 
determined cDNA sequences for the isolated clones are provided in SEQ ID NO: 238- 
285. Comparison of these sequences with those in the public databases revealed no 

20 significant homologies to the sequences of SEQ ID NO: 253, 260, 277 and 285. The 
sequences of SEQ ID NO: 249, 250, 256, 266, 276 and 282 were found to show some 
homology to previously isolated expressed sequence tags (ESTs). The sequences of 
SEQ ID NO: 238-248, 251, 252, 254, 255, 257-259, 261-263, 265, 267-275, 278-281, 
283 and 284 were found to show some homology to previously identified DNA or RNA 

25 sequences. 

The expression levels of certain of the isolated antigens in lung tumor 
tissues compared to expression levels in normal tissues was determined by microarray 
technology. The results of these studies are shown below in Table 2, together with the 
databank analyses for these sequences. 

30 
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Table 2 



Clone 


SEQ 
ID 
NO: 


Description 


LT+ 
F/N 


SCC+M/ 
N 


Squa/N 


Adeno/ 
N 


2LT-3 


238 


Unknown 
(KIAA0712) 


2.2 


3.8 


3.3 




2LT-6 


239 


Lactate DH B 


2.3 


3.8 


4.1 




2LT-22 


240 


Fumarate hydratase 


_ 


3.0 






2LT-26 


242 


CG1-39 




_ 


12.8 




2LT-31 


243 


ADH7 






8.4 


2.2 


2LT-36 


244 


ADH7 




2.4 


2.0 


- 


2LT-42 


245 


HMG-CoA synthase 


2.2 


2.6 


2.2 




2LT-54 


247 


(Mus) ninein 


. 


2.1 


. 




2LT-55 


248 


Ubiquitin 


2.2 




2.5 


2.0 


2LT-57 


249 


Novel 


2.1 


2.9 


2.4 




2LT-58 


250 


Novel 


2.3 


4.0 


2.9 


. 


2LT-59 


251 


Unknown KIAA0784 


2.4 


3.0 


2.3 


2.0 


2LT 6 

2 


252 


Nuc Pore Cmplx-ass 
pro TPR 








2.1 


2LT-70 


256 


Unknown KIAA0871 




2.5 


2.2 


2.1 


2LT-73 


257 


Mus polyadenylate- 
binding 


- 


2.0 






2LT-76 


259 


Trans-Golgi p230 


2.1 


- 


2.6 




2LT-85 


263 


Ribosomal protein 
(LS29) 








2.1 


2LT-89 


265 


Unknown 
PAC212G6 




2.0 






2LT-98 


268 


Melanoma diff assoc 
pro 9 








2.2 


2LT- 
100 


269 


Mus Collagen alpha 
VI 








2.1 


2LT- 
105 


271 


NY-CO-7 antigen 




3.2 






2LT- 
108 


273 


Unknown 
RG363M04 




3.1 






2LT- 
124 


279 


Galectin-9 (secreted) 


2.3 


2.7 


2.0 




2LT- 
126 


280 


LI element L1.33 
p40 


2.5 




3.1 




2LT- 
128 


282 


Novel (kappa B-ras 
2) 


2.3+ 




20.4 


2.5 


2LT- 
133 


284 


Alpha II spectrin 




2.3 







LT+F/N = Lung Tumor plus Fetal tissue over Normal tissues 
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SC+M/N = Lung Small Cell carcinoma plus Metastatic over Normal tissues 
Squa/N = Squamous lung tumor over Normal tissues 
Aden/N = Adenocarcinoma over Normal tissues 

Full-length sequencing studies on antigen 2LT-128 (SEQ ID NO: 282) 
5 resulted in the isolation of the full-length cDNA sequence provided in SEQ ID NO: 392. 
This amino acid sequence encoded by this full-length cDNA sequence is provided in 
SEQ ID NO: 393. This antigen shows 20-fold over-expression in squamous cell 
carcinoma and 2.5-fold over-expression in lung adenocarcinoma. This gene has been 
described as a potential ras oncogene (Fenwick et al. Science, 257:869-873, 2000). 
10 Extended sequence information was obtained for clones 2LT-3 (SEQ ID 

NO:238), 2LT-26 (SEQ ID NO:242), 2LT-57 (SEQ ID NO: 249), 2LT-58 (SEQ ID 
NO:250), 2LT-98 (SEQ ID NO:268) and 2LT-124 (SEQ ID NO:279). The extended 
cDNA sequences for these clones are set forth in SEQ ID NOs:428-433, respectively, 
encoding the polypeptide sequences set forth in SEQ ID NOs: 434-439, respectively. 

15 

Example 5 

Determination Of Tissue Specificity Of Lung Tumor Polypeptides 

Using gene specific primers, mKNA expression levels for representative 
lung tumor polypeptides were examined in a variety of normal and tumor tissues using 
20 RT-PCR. 

Briefly, total RNA was extracted from a variety of normal and tumor 
tissues using Trizol reagent. First strand synthesis was carried out using 2 ug of total 
RNA with Superscript II reverse transcriptase (BRL Life Technologies) at 42 °C for one 
hour. The cDNA was then amplified by PCR with gene-specific primers. To ensure the 

25 semi-quantitative nature of the RT-PCR, p-actin was used as an internal control for each 
of the tissues examined. 1 ul of 1 :30 dilution of cDNA was employed to enable the 
linear range amplification of the (3-actin template and was sensitive enough to reflect the 
differences in the initial copy numbers. Using these conditions, the P-actin levels were 
determined for each reverse transcription reaction from each tissue. DNA 

30 contamination was minimized by DNase treatment and by assuring a negative PCR 
result when using first strand cDNA that was prepared without adding reverse 
transcriptase. 

mRNA Expression levels were examined in five different types of tumor 
tissue (lung squamous tumor from 3 patients, lung adenocarcinoma, prostate tumor, 
35 colon tumor and lung tumor), and different normal tissues, including lung from four 
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patients, prostate, brain, kidney, liver, ovary, skeletal muscle, skin, small intestine, 
myocardium, retina and testes. L86S-46 was found to be expressed at high levels in 
lung squamous tumor, colon tumor and prostate tumor, and was undetectable in the 
other tissues examined. L86S-5 was found to be expressed in the lung tumor samples 
5 and in 2 out of 4 normal lung samples, but not in the other normal or tumor tissues 
tested. L86S-16 was found to be expressed in all tissues except normal liver and normal 
stomach. Using real-time PCR, L86S-46 was found to be over-expressed in lung 
squamous tissue and normal tonsil, with expression being low or undetectable in all 
other tissues examined. 

10 

Example 6 

Isolation of dna Sequences Encoding Lung Tumor Antigens 

DNA sequences encoding antigens potentially involved in squamous cell 
lung tumor formation were isolated as follows. 

15 A lung tumor directional cDNA expression library was constructed 

employing the Lambda ZAP Express expression system (Stratagene, La Jolla, CA). 
Total RNA for the library was taken from a pool of two human squamous epithelial 
lung carcinomas and poly A+ RNA was isolated using oligo-dT cellulose (Gibco BRL, 
Gaithersburg, MD). Phagemid were rescued at random and the cDNA sequences of 

20 isolated clones were determined. 

The determined cDNA sequence for the clone SLT-T1 is provided in 
SEQ ID NO: 102, with the determined 5' cDNA sequences for the clones SLT-T2, SLT- 
T3, SLT-T5, SLT-T7, SLT-T9, SLT-T10, SLT-T11 and SLT-T12 being provided in 
SEQ ID NO: 103-1 10, respectively. The corresponding predicted amino acid sequence 

25 for SLT-T1, SLT-T2, SLT-T3, SLT-T10 and SLT-T12 are provided in SEQ ID NO: 
111-115, respectively. Comparison of the sequences for SLT-T2, SLT-T3, SLT-T5, 
SLT-T7, SLT-T9 and SLT-T11 with those in the public databases as described above, 
revealed no significant homologies. The sequences for SLT-T10 and SLT-T12 were 
found to show some homology to sequences previously identified in humans. 

30 The sequence of SLT-T1 was determined to show some homology to a 

PAC clone of unknown protein function. The cDNA sequence of SLT-T1 (SEQ ID 
NO: 102) was found to contain a mutator (MUTT) domain. Such domains are known to 
function in removal of damaged guanine from DNA that can cause A to G transversions 
(see, for example, el-Deiry, W.S., 1997 Curr. Opin. Oncol. 9:79-87; Okamoto, K. et al. 

35 1996 Int. J. Cancer 65:437-41; Wu, C. et al. 1995 Biochem. Biophys. Res. Commun. 
214:1239-45; Porter, D.W. et al. 1996 Chem. Res. Toxicol. 9:1375-81). SLT-T1 may 
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thus be of use in the treatment, by gene therapy, of lung cancers caused by, or associated 
with, a disruption in DNA repair. 

In further studies, DNA sequences encoding antigens potentially 
involved in adenocarcinoma lung tumor formation were isolated as follows. A human 
5 lung tumor directional cDNA expression library was constructed employing the Lambda 
ZAP Express expression system (Stratagene, La Jolla, CA). Total RNA for the library 
was taken from a late SCID mouse passaged human adenocarcinoma and poly A+ RNA 
was isolated using the Message Maker kit (Gibco BRL, Gaithersburg, MD). Phagemid 
were rescued at random and the cDNA sequences of isolated clones were determined. 

10 The determined 5' cDNA sequences for five isolated clones (referred to 

as SALT-T3, SALT-T4, SALT-T7, SALT-T8, and SALT-T9) are provided in SEQ ID 
NO: 1 16-120, with the corresponding predicted amino acid sequences being provided in 
SEQ ID NO: 121-125. SALT-T3 was found to show 98% identity to the previously 
identified human transducin-like enhancer protein TLE2. SALT-T4 appears to be the 

15 human homologue of the mouse H beta 58 gene. SALT-T7 was found to have 97% 
identity to human 3-mercaptopyruvate sulfurtransferase and SALT-T8 was found to 
show homology to human interferon-inducible protein 1-8U. SALT-T9 shows 
approximately 90% identity to human mucin MUC 5B. 

cDNA sequences encoding antigens potentially involved in small cell 

20 lung carcinoma development were isolated as follows. cDNA expression libraries were 
constructed with mRNA from the small cell lung carcinoma cell lines NCIH69, 
NCIH128 and DMS79 (all available from the American Type Culture Collection, 
Manassas, VA) employing the Lambda ZAP Express expression system (Stratagene, La 
Jolla, CA). Phagemid were rescued at random and the cDNA sequences of 27 isolated 

25 clones were determined. Comparison of the determined cDNA sequences revealed no 
significant homologies to the sequences of SEQ ID NO: 372 and 373. The sequences of 
SEQ ID NO: 364, 369, 377, 379 and 386 showed some homology to previously isolated 
ESTs. The sequences of the remaining 20 clones showed some homology to previously 
identified genes. The cDNA sequences of these clones are provided in SEQ ID NO: 

30 363, 365-368, 370, 371, 374-376, 378, 380-385 and 387-389, wherein SEQ ID NO: 
363, 366-368, 370, 375, 376, 378, 380-382, 384 and 385 are full-length sequences. 

Comparison of the cDNA sequence of SEQ ID NO: 372 indicated that 
this clone (referred to as 128T1) is a novel member of a family of putative seven pass 
transmembrane proteins. Specifically, using the computer algorithm PSORT, the 

35 protein was predicted to be a type IIIA plasma membrane seven pass transmembrane 
protein. A genomic clone was identified in the Genbank database which contained the 
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predicted N-terminal 58 amino acids missing from the amino acid sequence encoded by 
SEQ ID NO: 372. The determined full-length cDNA sequence for the 128T1 clone is 
provided in SEQ ID NO: 390, with the corresponding amino acid sequence being 
provided in SEQ ID NO: 391. 
5 The expression levels of certain of the isolated antigens in lung tumor 

tissues compared to expression levels in normal tissues was determined by microarray 
technology. The results of these studies are shown below in Table 3, together with the 
databank analyses for these sequences. 

10 Table 3 



Clone 


SEQ ID 
NO: 


Description 


LT+ 
F/N 


SCC+ 
M/N 


Squa/N 


Adeno/ 
N 


DMS79- 
Tl 


363 


STAT-ind inhib of 
cytokine 




2.0 


- 




DMS79- 
T6 


367 


Neuronal cell death 
related 




2.2 






DMS79- 
T9 


369 


Novel 




2.2 






DMS79- 
T10 


370 


Ubiquitin carrier 
protein 




3.9 


2.2 




DMS79- 
Tll 


371 


HPV16E1 pro binding 
protein 




2.1 






128-T9 


378 


Elongation factor 1 
alpha 




2.7 






128T11 


380 


Malate dehyrogenase 




2.3 


2.0 




128-T12 


381 


Apurinic/apyrim 
endonuclease 




5.4 






NCIH69 
-T3 


382 


Sm-like protein CaSm 






2.4 




NCIH69 
-T6 


384 


Transcription factor 
BTF3a 




2.5 







LT+F/N = Lung Tumor plus Fetal tissue over Normal tissues 
SC+M/N = Lung Small Cell carcinoma plus Metastatic over Normal tissues 
Squa/N = Squamous lung tumor over Normal tissues 
15 Aden/N = Adenocarcinoma over Normal tissues 
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Example 7 
Synthesis of Polypeptides 
Polypeptides may be synthesized on a Perkin Elmer/ Applied Biosystems 
Division 43 OA peptide synthesizer using FMOC chemistry with HPTU (0- 
5 Benzotriazole-NjNjN'jN'-tetramethyluronium hexafluorophosphate) activation. A Gly- 
Cys-Gly sequence may be attached to the amino terminus of the peptide to provide a 
method of conjugation, binding to an immobilized surface, or labeling of the peptide. 
Cleavage of the peptides from the solid support may be carried out using the following 
cleavage mixture: trifluoroacetic acid:ethanedithiol:thioanisole:water:phenol 
10 (40:1:2:2:3). After cleaving for 2 hours, the peptides may be precipitated in cold 
methyl-t-butyl-ether. The peptide pellets may then be dissolved in water containing 
0.1% trifluoroacetic acid (TFA) and lyophilized prior to purification by CI 8 reverse 
phase HPLC. A gradient of 0%-60% acetonitrile (containing 0.1% TFA) in water 
(containing 0.1% TFA) may be used to elute the peptides. Following lyophilization of 
15 the pure fractions, the peptides may be characterized using electrospray or other types of 
mass spectrometry and by amino acid analysis. 

Example 8 

Isolation and Characterization of DNA Sequences Encoding Lung 
20 Tumor Antigens by T-Cell Expression Cloning 

Lung tumor antigens may also be identified by T cell expression cloning. 
One source of tumor specific T cells is from surgically excised tumors from human 
patients. 

A non-small cell lung carcinoma was minced and enzymatically digested 
25 for several hours to release tumor cells and infiltrating lymphocytes (tumor infiltrating T 
cells, or TILs). The cells were washed in HBSS buffer and passed over a Ficoll 
(100%/75%/HBSS) discontinuous gradient to separate tumor cells and lymphocytes 
from non- viable cells. Two bands were harvested from the interfaces; the upper band at 
the 75%/HBSS interface contained predominantly tumor cells, while the lower band at 
30 the 100%/75%/HBSS interface contained a majority of lymphocytes. The TILs were 
expanded in culture, either in 24-well plates with culture media supplemented with 10 
ng/ml IL-7 and 100 U/ml IL-2, or alternatively, 24-well plates that have been pre-coated 
with the anti-CD3 monoclonal antibody OKT3. The resulting TIL cultures were 
analyzed by FACS to confirm that a high percentage were CD8+ T cells (>90% of gated 
35 population) with only a small percentage of CD4+ cells. 
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In addition, non-small cell lung carcinoma cells were expanded in culture 
using standard techniques to establish a tumor cell line (referred to as LT391-06), which 
was later confirmed to be a lung carcinoma cell line by immunohistochemical analysis. 
This tumor cell line was transduced with a retroviral vector to express human CD80, 
5 and characterized by FACS analysis to confirm high expression levels of CD80, class I 
MHC and class II MHC molecules. 

The ability of the TIL lines to specifically recognize autologous lung 
tumor was demonstrated by cytokine release assays (IFN-y and TNF-a) as well as 51 Cr 
release assays. Briefly, TIL cells from day 21 cultures were co-cultured with either 

10 autologous or allogeneic tumor cells, EBV-immortalized LCL, or control cell lines 
Daudi and K562, and the culture supernatant monitored by ELISA for the presence of 
cytokines. The TIL specifically recognized autologous tumor but not allogeneic tumor. 
In addition, there was no recognition of EBV-immortalized LCL or the control cell 
lines, indicating that the TIL lines are tumor specific and are potentially recognizing a 

1 5 tumor antigen presented by autologous MHC molecules. 

The characterized tumor-specific TIL lines were expanded to suitable 
numbers for T cell expression cloning using soluble anti-CD3 antibody in culture with 
irradiated EBV transformed LCLs and PBL feeder cells in the presence of 20 U/ml IL-2. 
Clones from the expanded TIL lines were generated by standard limiting dilution 

20 techniques. Specifically, TIL cells were seeded at 0.5 cells/well in a 96-well U bottom 
plate and stimulated with CD-80-transduced autologous tumor cells, EBV transformed 
LCL, and PBL feeder cells in the presence of 50 U/ml IL-2. The specificity of these 
clones for autologous tumor was confirmed by 51 Cr microcytotoxicity and IFN-y 
bioassays. 

25 These CTL clones were demonstrated to be HLA-B/C restricted by 

antibody blocking experiments. A representative CTL clone was tested on a panel of 
allogeneic lung carcinomas and it recognized both autologous tumor and a lung 
squamous cell carcinoma (936T). As the only class I MHC molecule shared among 
these tumors was HLA-Cwl203, this indicated that this was the restriction element used 

30 by the CTL. This finding was confirmed by the recognition of a number of allogeneic 
lung carcinomas transduced with a retroviral vector encoding HLA-Cwl203 by the 
CTL. 

PolyA mRNA was prepared from a lung tumor cell line referred to as 
LT391-06 using Message Maker (Life Technologies; Rockville, MD). The subsequent 
35 steps involving cDNA synthesis were performed according to Life Technologies 
cloning manual (Superscript Plasmid System for cDNA Synthesis and Plasmid 
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Cloning). Modifications to the protocol were made as follows. At the adapter addition 
step, EcoRI-XmnI adapters (New England Biolabs; Beverly, MA) were substituted. Size 
fractionated cDNAs were ligated into the expression vector system HisMax A, B, C 
(Invitrogen; Carlsbad, CA) to optimize for protein expression in all three coding frames. 
5 Library plasmids were then aliquotted at approximately 100 CFU/well into a 96-well 
block for overnight liquid amplification. From these cultures, glycerol stocks were made 
and pooled plasmid was prepared by automated robot (Qiagen; Valencia, CA). The 
concentration of the plasmid DNA in each well of the library plates was determined to 
be approximately 150 ng/ul. Initial characterization of the cDNA expression library was 
10 performed by randomly sequencing 24 primary transformants and subjecting the 
resulting sequences to BLAST searches against available databases. The determined 
cDNA sequences are provided in SEQ ID NO: 443-480, with the results of the BLAST 
searches being provided in Table 4. 

15 Table 4 



Clone 


SEQ ID NO: 


GenBank 
Accession 


Description 


55163 


458, 459 




Novel in Genbank 


55158 


452 




Novel in Genbank 


Homology to known sequences with unknown function 


55153 


443, 444 


7018516 


H. sapiens mRNA; cDNA 
DKFZp434M035 


55154 


445, 446 


6437562 


H. sapiens Chr 22ql 1 PAC Clone p393 


55157 


450, 451 


2887408 


H. sapiens KIAA0417 mRNA 


55165 


462, 463 


3970871 


H. sapiens HRIHFB2122 mRNA 


Homology to known sequences with known function 


55155 


447 


7677405 


H. sapiens F-box protein FBS (FBS) 


55156 


448, 449 


3929584 


H. sapiens EEN pseudogene 


55161 


454, 455 


4503350 


H. sapiens DNA (cytosine-5-)- 
methyltransferase 1 (DNMT1) 


55162 


456, 457 


31220 


ERK1 mRNA for protein serine/threonine 
kinase 


55164 


460, 461 


6677666 


H. sapiens RNA-binding protein 
(autoantigenic) (RALY) 


55166 


464, 465 


3249540 


H. sapiens ribonuclease P protein subunit 
p40 (RPP40) 


55167 


466, 467 


7657497 


H. sapiens renal tumor antigen (RAGE) 


55168 


468, 469 


2873376 


H. sapiens exportin t mRNA 


55169 


470, 471 


3135472 


H. sapiens Cre binding protein-like 2 
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Clone 


SEQ ID NO: 


GenBank 
Accession 


Description 








mRNA 


55171 


474 


4759151 


H. sapiens spermine synthase (SMS) 


55173 


476 


6688148 


H. sapiens partial mRNA for NICE-3 
protein 


55174 


477 478 


531394 


Human, transcriptional coactivator PC4 


55175 


479 


6563201 


H. sapiens translation initiation factor elF- 
2b delta subunit 


55176 


480 


29860 


hCENP-Bgene, for centromere autoantigen 
B (CENP-B) 


Homology to Ribosomal Protein 


55159 


453 


337494 


Ribosomal protein L7a (surf 3) large 
subunit mRNA * 


55170 


472, 473 


4506648 


H. sapiens mRNA for ribosomal protein L3 


55172 


475 


388031 


H. sapiens ribosomal protein LI 1 



For T cell screening, approximately 80 ng of the library plasmid DNA 
and 80 ng of HLA-Cwl203 plasmid DNA was mixed with the lipid Fugene according 
to the manufacturers' instructions and transfected in duplicate into COS-7 cells. After 
5 incubation at 37 °C for 48 hours, the transfection mixture was removed and 10,000 
LT391-06 CTL were added to each well in fresh media containing human serum. 

The ability of T cells to recognize an antigen in the library was assessed 
by cytokine release after 6 hours (TNF-alpha, WEHI bio-assay) or after 24 hours (IFN- 
gamma, ELISA). Approximately 2.0 x 10 s clones (in plasmid pools of 100) were 

10 screened using this system in COS-7 cells. Three plasmid pools were identified 
(referred to as 14F10, 19A4, and 20E10) that were recognized by LT391-06 CTL. 
Transfection of these plasmid pools into COS-7 cells led to production of both IFN- 
gamma and TNF-alpha from the LT391-06 CTL at levels significantly above 
background. Pools 14F10, 19A4 and 20E10 were "broken down" into several hundred 

15 individual plasmid DNAs and retested. The sequences of 24 novel clones isolated from 
pool 14F10 are provided in SEQ ID NO: 481-51 1 . 

One plasmid (3D9) from pool 14F10, one plasmid from pool 20E10 and 
5 plasmids (2A6, 2E11, 2F12, 3F4, 3H8) from pool 19A4 were capable of 
reconstituting T cell recognition. Sequencing of these plasmids led to the identification 

20 of a 7.8 kB cDNA insert (referred to as clone 14F10), a 2.2 kB cDNA insert (referred to 
as clone 19A4; SEQ ID NO:440), and a clone referred to as 20E10. The full-length 
cDNA sequence for 14F10 is provided in SEQ ID NO: 441. Clone 14F10 does not 
contain the first two "G" nucleotides found at the 5' end of 19A4, and the 3 '-proximal 
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24 bp of 19A4 differ from the corresponding region of 14F10 (nucleotides 2145-2165). 
Furthermore, 3837 bp of 3' additional sequence was isolated for clone 14F10. The 5' 
terminal cDNA sequence (337 bp) of clone 20E10 is provided in SEQ ID NO: 442. 
20E10 contains an additional 3 nucleotides (as compared to 19A4) at the 5 '-most end. 
5 The additional sequence from the 5' end of clone 20E10 contains an "ATG" and 
therefore appears to contain the translational start site of a novel open reading frame. 
BLAST search analysis against the GenBank database identified these sequences as 
having significant homology with a truncated human cystine/glutamate transporter gene. 
Unlike the published sequence, however, clones 14F10 and 19A4 contain a unique 5' 

10 terminus consisting of 181 nucleotides. This novel sequence replaces the published 5' 
region and results in the removal of the reported initiating methionine (start codon) and 
an additional two amino acids of the reported transporter protein. Therefore, the 
translated product of clones 14F10 and 19A4 is different than the cystine/glutamate 
transporter protein. Furthermore, T cell recognition of other lung tumors demonstrates 

15 that this antigen is expressed by other tumors as well. 

The epitope and amino acid sequence encoded within clones 19A4 and 
14F10 which reconstitutes T cell recognition of anti-LT391-06 cells were mapped as 
follows. Cos-7 cells were transfected with 80 ng/well HLA-Cwl203 along with titrated 
amounts of cDNA encoding clone 19A4, a potential open reading frame located in the 

20 unique 5' terminus of 19A4, or the open reading frame from the cystine/glutamate (Cys- 
Glu) transporter gene, cloned into a eukaryotic expression vector and tested for 
stimulation of anti-LT391-06 T cells in a TNF assay. As a positive control Cos-7 cells 
were co-transfected with HLA-Cwl203 and the positive plasmid clone 19A4 described 
above. The Cys-Glu transporter expression construct was isolated by PCR using 5' and 

25 3' primers specific for the known ORF of the transporter with 19A4 as template. In 
addition, each 5' primer contained a Kozak translation initiation site and starting 
methionine to drive translation of the polypeptide. CTL against LT391-06 did not 
recognize transfectants expressing the Cys-Glu transporter construct, but did recognize 
transfectants expressing 19A4 and the 5' ORF from 19A4. 

30 In subsequent experiments, Cos-7 cells were co-transfected with 80 

ng/well HLA-Cwl203 along with titrated amounts of DNA of transposition mutants 
F10 and CI 2, respectively, and tested for simulation of anti-LT391-06 T cells in a TNF 
assay. As a positive control, Cos-7 cells were co-transfected with HLA-Cwl203 and 
clones of the 5' ORF of 19A4. Transposition mutants F10 and C12 were obtained by 

35 transposon-mediated mutation of the 14F10 clone and screening for insertion site by 
sequence analyses. The transposon of mutant F10 is inserted approximately 304 bp 
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from the 5' EcoRI cloning site of the 14F10 cDNA. This mutation did not disrupt 
translation of the T cell epitope. By contrast, the transposon of mutant CI 2, which is 
inserted approximately 116 bp from the 5' EcoRI cloning site of the 14F10 cDNA, was 
found to interrupt translation of the T cell eptiope. Thus the epitope in 14F10 maps 
5 between these two transposon insertion sites. The amino acid sequence of the region 
between the C12 and F10 transposon insertion sites is provided in SEQ ID NO: 586. 

A series of 11 overlapping 16-mer and 15-mer peptides for the region 
shown in SEQ ID NO: 586 were prepared and tested for stimulation of anti-LT391-06 
cells, as determined by cytokine release in TNF and IFN-y assays. Only the peptide 
10 provided in SEQ ID NO: 587 (corresponding to residues 5-20 of SEQ ID NO: 586) 
stimulated cytokine release. These studies demonstrate that the HLA-Cwl203 restricted 
epitope of the LT391-06 antigen is contained within SEQ ID NO: 587. 

Example 9 

1 5 Isolation and Characterization of DNA Sequences Encoding Lung 

Tumor Antigens by PCR Subtraction 

This example describes the isolation and characterization of cDNA 
clones from a PCR subtracted expression library prepared from the human lung tumor 
cell line LT391-06 described above. 

20 Tester poly A mRNA was prepared from the cell line LT391-06 as 

described above. Driver poly A mRNA was isolated from a human acute T cell 
leukemia/T lymphocyte cell line (Jurkat) which is derived from non-lung cells and is 
not recognized by LT391-06 reactive T cells. The subtraction was performed according 
to the method of Clontech (Palo Alto, CA) with the following changes: 1) a second 

25 restriction digestion reaction of cDNA was completed using a pool of enzymes (MscI, 
PvuII, StuI and Dral). This was in addition to, and separate from, the Clontech 
recommended single restriction enzyme digestion with Rsal. Each restriction digest set 
was treated as a separate library to ensure that the final mixed library contained 
overlapping fragments. Thus, the epitope recognized by the T cells should be 

30 represented on a fragment within the library and not destroyed by the presence of a 
single restriction site within it. 2) The ratio of driver to tester cDNA was increased in 
the hybridization steps to increase subtraction stringency. To analyze the efficiency of 
the subtraction, actin was PCR amplified from dilutions of subtracted, as well as 
unsubtracted, PCR samples. The second amplification step utilized primers that were 

35 modified from those normally used. Three nested PCR primers were engineered to 
contain a cleavable EcoRI site (not utilized during cloning) that was in one of three 
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frames. Thus, secondary amplification with these primers resulted in products that 
could be ligated directly into the eukaryotic expression plasmid pcDNA4His/Max-Topo 
(Invitrogen). This resulted in the PCR subtracted and amplified fragments being 
represented in-frame somewhere within the library. Due to the mechanics of the 
5 subtraction only 50% of fragments will be in the correct orientation. The complexity 
and redundancy of the library was characterized by sequencing 96 randomly picked 
clones from the final pooled PCR subtraction expression library, referred to as LT391- 
06PCR. These sequences (SEQ ID NO: 512-581) were analyzed by comparison to 
sequences in publicly available databases (Table 5). 

10 

Tables 



Clone 


SEQ ID 
NO: 


GenBank 
Accession 


Description 


57235 


532 




Novel in Genbank 


57255 


547 




Novel in Genbank 


57264 


554 




Novel in Genbank 


Homology to known sequences with unknown function 


57215 


518 


5689540 


H. sapiens mRNA for KIAA1 102 protein 


57223 


522 


2341006 


Human Xq 13 3' end of PAC 92E23 


57227 


524 


7022540 


H. sapiens cDNA FLJ10480 fis, clone 
NT2RP2000126 


57238 


535 


6807795 


H. sapiens mRNA; cDNA 
DKFZp761G02121 


57239 


536 


5757546 


H. sapiens clone DJ0823F17 


57243 


539 


7023805 


H. sapiens cDNA FLJ1 1259 fis, clone 
PLACE1009045 


57245 


540 


4884472 


H. sapiens mRNA; cDNA 
DKFZp58602223 


57267 


557 


6808218 


H. sapiens mRNA; cDNA 
DKFZp43401519 


57268 


558 


10040400 


Sequence 12 from Patent WO9954460 


57270 


560 


7959775 


H. sapiens PR01489 mRNA 


57271 


561 


4500158 


H. sapiens mRNA; cDNA 
DKFZp586B0918 


57281 


567 


6560920 


H. sapiens clone RP1 1- 501O7 


57283 


569 


285962 


Human mRNA for KIAA0108 gene 


57285 


570 


7019813 


H. sapiens cDNA FLJ20002 fis, clone 
ADKA01577 


Homology to known sequences with known function 


57207 | 512 | 517176 | H. sapiens YAP65 mRNA 
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Clone 


SEQ ID 
NO: 


GenBank 
Accession 


Description 


57210 


514 


6841233 


H. sapiens HSPC292 mRNA 


57211 


515 


2606093 


H. sapiens Cyr61 protein (CYR61) mRNA 


57212 


516 


339648 


Human thioredoxin (TXN) mRNA 


57219 


519 


4504616 


H. sapiens insulin- like growth factor 
binding protein 3 (IGFBP3) 


57221 


520 


7274241 


H sapiens novel retinal pigment epithelial 
cell protein (NORPEG) 


57222 


521 


189564 


Hurn.HH plasminogen activator inhibitor- 1 
gene 


57228 


525 


4757755 


TT <35mi£M*iQ annf^vin A 9 ( A7sT5TA9^ 

ri. o dpi Clio alJLllC.AJ.ll r\z, \jr\l\j\Jr\^) 


57230 


527 


180800 


Human alpha- 1 collagen type IV gene, 


57232 


529 


6729061 


H. sapiens clone RPC1 1- 98D12 from 
7q31 


57233 


530 


338391 


Spermidine/ spermine Nl- 
acetyltransferase 


57234 


531 


7305302 


H. sapiens NCK- associated protein 1 
(NCKAP1) 


57236 


533 


4929722 


H sapiens CGI- 127 protein 


57242 


538 


4503558 


H. sapiens epithelial membrane protein 1 
(EMP1) 


57248 


541 


183585 


Human pregnancy- specific beta- 
glycoprotein c 


57250 


543 


4759283 


H. sapiens ubiquitin carboxyl- terminal 

CbLclaac 1^1 v^JTiJLvl j 


57251 


544 


1236321 


Human laminin gamma2 chain gene 
rr a Mr? » 


57253 


545 


213831 


H sapiens lysyl hydroxylase isoform 2 
(PLOD2) 


57254 


546 


536897 


Human follistatin- related protein 
precursor mRNA 


57257 


548 


339656 


Human endothelial cell thrombomodulin 


57258 


549 


190467 


Human prion protein (PrP) mRNA 


57261 


551 


338031 


Human serglycin gene 


57262 


552 


178430 


Human alphoid DNA (alphoid repetitive 
sequenc e) 


57265 


555 


4502562 


H. sapiens calpain, large polypeptide L2 
(CAPN2) 


57266 


556 


398163 


H. sapiens mRNA for insulin- like growth 
factor binding protein- 3 


57269 


559 


7262375 


H. carboxylesterase 2 (intestine, liver) 
(CES2) 


57272 


562 


467560 


H. sapiens mRNA for cysteine 
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Clone 


SEQID 
NO: 


GenBank 
Accession 


Description 








dioxygenase type 1 


57274 


563 


482664 


H. sapiens annexin A3 (ANXA3) 


57275 


564 


2281904 


H. sapiens Bruton's tyr. kinase (BTK), 
alpha~ D~ galactosidase A (GLA) 


57277 


565 


4557498 


H. sapiens C- terminal binding protein 2 . 
(CTBP2) 


57282 


568 


1 89245 


Human, NAD( P) H* menadione 
oxidoreductase mRNA 


57287 


571 


28525 


Human mRNA for amyloid A4 precursor 
of Alzheimer's disease 


57288 


572 


4757755 


H. sapiens annexin A2 (ANXA2) 


57289 


573 


5729841 


H. sapiens glyoxalase I (GLOl) mRNA 


57290 


574 


6103642 


H. sapiens F- box protein FBX3 mRNA 


57295 


576 


182513 


Human ferritin L chain mRNA 


57299 


579 


37137 


Human mRNA for thrombospondin 


57301 


580 


179682 


Human (clone A 12) C4b- binding protein 
bctci- chsiii 


57302 


581 




H. Sctpiens membrane inet&Ilo - 
endopeptici&se (neutr&l 
cndop6ptid.&se ? enkephalina.se 5 CALLA, 
CD10)(MME) 


57213 


517 


2665791 


H. sapiens caveolin- 2 mRNA 


57259 


550 


2665791 


H. sapiens caveolin- 2 mRNA 


57225 


523 


179765 


Human calcyclin gene 


57229 


526 


179765 


Human calcyclin gene 


57237 


534 


186962 


Human laminin B2 chain gene 


57249 


542 


186962 


Human laminin B2 chain gene 


57231 


528 


4972626 


H. sapiens caveolin 1 (CAV1) gene 


57296 


577 


4972626 


H. sapiens caveolin 1 (CAV1) gene 


57297 


578 


4972626 


H. sapiens caveolin 1 (CAV1) gene 


57240 


537 


266237 


insulin- like growth factor binding protein 
3 


57292 


575 


184522 


Human insulin- like growth factor- 
binding protein- 3 gene 


57263 


553 


4504618 


H. sapiens insulin- like growth factor 
binding protein 7 (IGFBP7) 


57280 


566 


4504618 


H. sapiens insulin- like growth factor 
binding protein 7 (IGFBP7) 


Homology to Ribosomal Protein 


57209 


513 


337504 


Human ribosomal protein S24 mRNA 
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Example 10 

Isolation and Characterization of T Cell Receptors from T Cell Clones 
Specific for Lung Tumor Antigens 

This example describes the cloning and sequencing of T cell receptor 
5 (TCR) alpha and beta chains from a CD8 T cell clone specific for an antigen expressed 
by the lung tumor cell line LT391-06. T cells have a limited lifespan. Cloning of TCR 
chains and subsequent transfer would essentially enable infinite propagation of the T 
cell specificity. Cloning of tumor antigen TCR chains allows the transfer of the 
specificity into T cells isolated from patients that share TCR MHC-restricting alleles. 

1 0 Such T cells can then be expanded and used in adoptive transfer techniques to introduce 
the tumor antigen specificity into patients carrying tumors that express the antigen (see, 
for example, Clay et al. J. Immunol. 163:507 (1999)). 

Cytotoxic T lymphocyte (CTL) clones specific for the lung tumor cell 
line LT391-06 were generated. Total mRNA from 2 x 10 6 cells from 15 such clones 

15 was isolated using Trizol reagent and cDNA was synthesized using Ready-to-Go kits 
(Pharmacia). To determine Va and Vb sequences in these clones, a panel of Va and Vb 
subtype-specific primers was synthesized and used in RT-PCR reactions with cDNA 
generated from each of the clones. The RT-PCR reactions demonstrated that each of 
the clones expressed a common Vb sequence that corresponded to the Vbl3 subfamily. 

20 Using cDNA generated from one of the clones (referred to as 1105), the Va sequence 
expressed was determined to be Va22. To clone the full TCR alpha and beta chains 
from clone 1105, primers were designed that spanned the initiator and terminator- 
coding TCR nucleotides. Standard 35-cycle RT-PCR reactions were established using 
cDNA synthesized from the CTL clone and the primers, with PWO (BMB) as the 

25 thermostable polymerase. The resultant specific bands (approximately 850 bp for the 
alpha chain and approximately 950 bp for the beta chain) were ligated into the PGR 
blunt vector (Invitrogen) and transformed into E. coli. E. coli transformed with 
plasmids containing the full-length alpha and beta chains were identified, and large 
scale preparations of the corresponding plasmids were generated. Plasmids containing 

30 full-length TCR alpha and beta chains were sequenced. The determined cDNA 
sequences for the alpha and beta chains are provided in SEQ ID NO: 583 and 582, 
respectively, with the corresponding amino acid sequences being provided in SEQ ID 
NO: 584 and 585, respectively. 



117 



WO 02/092001 



PCT7US02/14975 



Example 1 1 

Cloning of cDNAs Encoding Lung Small Cell Carcinoma Antigens 

Lung small cell carcinoma antigens were cloned by screening a small cell 
cDNA expression library with a mouse anti-SCID mouse serum. This antiserum was 
5 developed by growing lung small cell carcinoma cell lines NCIH69 and NCIH128 in 
SCID mice, removing SCID serum containing shed and secreted tumor antigens and 
immunizing normal mice with this serum. The library was constructed with mRNA 
from cell line NCIH128 in the lambda ZAP Express expression vector (Stratagene). 
The antiserum was adsorbed with E. coli lysate and human GAPDH protein and Ku 

10 autoantigens, and human PBMC lysate was added to the serum to block antibody to 
proteins found in normal tissue. 

Sixty clones were isolated and the inserts of these clones were 
sequenced. The isolated clones and their respective sequence and clone identifiers are 
presented in Tables 6 and 7. The isolated clone sequences were compared to sequences 

15 in publically available databases. A summary of the Genbank homologies is found in 
Tables 6 and 7. Those showing some degree of similarity with known sequences are 
described in Table 6, while those showing little or no similarity with known sequences 
are described in Table 7. 

20 Table 6 



SEQID 
NO:. 


CLONE 
ID# 


Genbank Homologies 


589 


54534 


Homo sapiens mRNA for LAK-1 


590 


54536 


Homo sapiens CGI- 108 protein mRNA 


591 


54538 


Human mRNA for HHR23 A protein 


592 


54540 


Homo sapiens chromosome 17, clone hRPC. 
1030 0 14 


593 


55084 


Homo sapiens homolog of rat elongation factor pi 8 
(pl8) 


594 


55086 


Homo sapiens HSPC194 mRNA 


595 


54555 


Homo sapiens accessory proteins BAP31/BAP29 
(DXS1357E)mRNA 


596 


54557 


Homo sapiens mesenchymal stem cell protein 
DSCD75 mRNA 


597 


54564 


Homo sapiens prp28, U5 snRNP 100 kd protein (U5- 
100K) mRNA 


599 


55473 


Homo sapiens uroporphyrinogen III synthase 
(congenital erythropoietic porphyria) (UROS 


600 


55104 


Homo sapiens carbonyl reductase (LOC51 181) 


601 


55105 


Homo sapiens membrane component, chromosome 11, 
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SEQID 
NO:. 


CLONE 
ID# 


Genbank Homologies 






surface marker 1 (M 1 1 S 1 ) 


602 


55107 


H. sapiens mRNA encoding GPI-anchored protein 
pl37 


604 


55114 


Homo sapiens mRNA; cDNA DKFZp56401716 


605 


55477 


H.sapiens YB-1 gene promoter region 


606 


55482 


Homo sapiens mRNA ; cDNA DKFZp434B0425 


607 


55483 


Human Gu protein mRNA 


608 


55485 


Homo sapiens 45kDa splicing factor mRNA 


609 


55487 


Homo sapiens genomic DNA, chromosome 21c[, 
section 72/105 


610 


55488 


Homo sapiens chromosome 17, clone hCIT529110 


612 


55089 


Homo sapiens scaffold attachment factor A (SAF~A) 
mRNA 


613 


55092 


Homo sapiens density regulated protein drpl mRNA 


614 


55093 


H.sapiens mRNA encoding GPI-anchored protein 
pl37 


615 


56926 


Homo sapiens high-mobility group (nonhistone 


617 


56944 


Mnmn QfmipriQ T<fR~NJ A-9 rr»-nrtivfltnr (\ OOVT^ (r\\ 0(Y\ 
.TJ.UIIJLU odpicllo JVDIN/A.-^, UU-dL/HVaLUI ^IUUjvJ-/^ \JJ X\J\JJ S 

mRNA 


619 


55490 


Homo sapiens death-associated protein 6 (DAXX) 
mRNA and translated products 


620 


55495 


Homo sapiens mRNA for ^1VIEGF6 


621 


55504 


Mus musculus hairy / enhancer of split 6 mRNA 


624 


56482 


H.sapiens DNA from chromosome 19-cosmids 
R31158, R31874, & R28125, genomic seq. 


626 


56487 


Human L23 mRNA for putative ribosomal protein 


627 


56488 


Homo sapiens cDNA FLJ10526 fis, clone 
NT2RP2000931, highly similar to MATRIN 3 


628 


56490 


Homo sapiens Su.ll isolog mRNA 


630 


56494 


Homo sapiens mRNA; cDNA DKFZp564B167 (from 
clone DKPZp564B 167) 


631 


56495 


Homo sapiens 12pl3.3 BAC RPC11 1-543P15 
(Roswell Park Cancer Inst. Human BAC lib.) 


632 


56499 


Human DNA-binding protein B (dbpB) gene, 3' end 


633 


56517 


Homo sapiens esterase D mRNA 


634 


56952 


Homo sapiens 14q32 Jagged2 gene, complete cds; and 

U1JJSJ.1U Wli gciic 


635 


56953 


Homo sapiens DNA polymerase zeta catalytic subunit 
(REV3L) mRNA 


637 


57139 


Homo sapiens ribosomal protein, large, PO (RPLPO) 
mRNA 


638 


57078 


Homo sapiens alpha-tubulin isoform 1 mRNA 



119 



WO 02/092001 



PCT7US02/14975 



SEQID 
NO:. 


CLONE 
ID# 


Genbank Homologies 


640 


57099 


Homo s&picns unchsrs-ctcrizcd. hypotli&l^xiius protein 
HBEX2 mRNA 


642 


57105 


Homo sapiens splicing factor, arginine/serine-rich 7 

n <! VTA f <?FR 97^ 

JisJ-J ) ^OPIvO / ) 


643 


57111 


Human chromosome 14 DNA sequence 


644 


57117 


Human DNA sequence from cosmid V857G56, 

DcLWCCll IIlalKcrb UAojDO duil JJAoo / On 

chromosome X contains ESTs 


645 


57121 


oncogene of hepatocellular colorectal and non-small 
cell lung cancer, segment 3/11 


646 


57124 


H.sapiens MLN50 mRNA 


647 


57125 


Homo sapiens calreticulin (CALR) , mRNA 


Table 7 


SEQID 
NO:. 


CLONE 
ID# 


Genbank Homologies 


588 


54533 


Novel 


598 


55098 


Novel 


603 


55108 


Novel 


611 


55087 


Novel (partial overlap of Unknown: Homo sapiens 
partial mRNA, clone cl-10el6) 


616 


56930 


Novel 


618 


56945 


Novel 


622 


55506 


Novel / (136bp: Mus musculus mRNA for Rab24 
protein) 


623 


56480 


Novel 


625 


56484 


Novel 


629 


56493 


Novel 


636 


56959 


Novel 


639 


57092 


Novel 


641 


57100 


Novel (last 120 bp: Unknown: Canine 21 kDa Signal 
peptase subunit mRNA) 



In further studies, the expression levels of certain of these disclosed 
5 isolated antigens were compared to the expression levels in 36 normal tissues using 
microarray technology and computer analysis. These sequences were arrayed on Chip 
#7. The results of these studies are shown below in Table 8. 
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Table 8 



Clone 
Name 


Clone ID # 


SEQ ID NO: 


Squa/N 


Aden/N 


SC/N 


LSCC2-1 


54533 


588 


3 


2 


1 


LSCC2-2 


54534 


589 


5 


3 


5 


LSCC2-4 


54536 


590 


3 


2 


2 


LSCC2-8 


54540 


592 


0 


3 


2 


LSCC2-18 


55084 


593 


2 


2 


1 


LSCC2-23 


54555 


595 


2 


3 


3 


LSCC2-25 


54557 


596 


2 


1 


1 


LSCC2-32 


54564 


597 


2 


3 


2 


LSCC2-48 


55473 


599 


4 


2 


1 


LSCC2-58 


55104 


600 


3 


5 


2 


LSCC2-61 


55107 


602 


2 


5 


3 


LSCC2-75 


55483 


607 


2 


4 


2 


LSCC2-79 


55487 


609 


3 


2 


2 


LSCC2-93 


55089 


612 


5 


4 


4 


LSCC2-121 


55490 


619 


4 


2 


2 


LSCC2-127 


55495 


620 


2 


4 


1 


LSCC2-137 


55504 


621 


0 


3 


8 


LSCC2-139 


55506 


622 


3 


4 


1 


LSCC2-161 


56480 


623 


3 


2 


1 


LSCC2-164 


56482 


624 


2 


4 


2 


LSCC2-171 


56488 


627 


6 


4 


5 


LSCC2-178 


56494 


670 


3 


5 


3 


LSCC2-191 


56517 


673 


5 


2 


2 



Squa/N = fold overexpression in squamous lung tumor versus normal tissues 
Aden/N = fold overexpression in adenocarcinoma versus normal tissues 
5 SC/N = fold overexpression in lung small cell carcinoma versus normal tissues 



Example 12 

Use of Mouse Antisera to Identify cDNA Sequences Encoding Lung Small Cell 
10 Carcinoma Antigens 

This example illustrates the isolation of cDNA sequences encoding lung 
small cell carcinoma antigens by screening a small cell carcinoma cell line cDNA 
library with mouse anti-SCID mouse sera. 

A directional cDNA expression library was constructed with mRNA 
15 from small cell carcinoma cell line NCIH128 employing the Lambda ZAP Express 
expression system (Stratagene, La Jolla, CA). Sera was obtained from SCID mice 
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containing human small cell carcinoma cell lines NCIH69 and NCIH128. The sera 
contains shed and secreted tumor antigens. These sera were pooled and injected into 
normal mice to produce anti-SCID mouse sera. The antiserum was absorbed with E. 
coli lysate, human GADPH protein and Ku autoantigens, and human PBMC lysate was 
5 added to the serum to block antibodies to proteins found in normal tissue. 

Thirty-nine clones were isolated and the inserts of these clones were 
sequenced. The isolated clones and their respective sequence and clone identifier are 
presented in Table 9. The clone sequences were compared to sequences in publicly 
available databases (Geneseq, GenBank and huESTdb). A summary of these 
10 comparisons are found in Tables 10 and 11. Those showing some degree of homology 
with known sequences are described in Table 10, while those showing little or no 
similarity to known sequences are described in Table 11. 



Table 9 



CLONE NAME 


SEQ. ID. NO: 


CLONE ID # 


LSCC-8 


648 


50664 


LSCC-13 


649 


50669 


LSCC-18 


650 


50673 


LSCC-25 


651 


50680 


LSCC-33 


652 


50685 


LSCC-47 


653 


50699 


LSCC-48 


654 


50700 


LSCC-50 


655 


50702 


LSCC-52 


656 


50704 


LSCC-58 


657 


50710 


LSCC-59 


658 


50711 


LSCC-67 


659 


50719 


LSCC-68 


660 


50720 


LSCC-73 


661 


50725 


LSCC-75 


662 


50727 


LSCC-77 


663 


50729 


LSCC-84 


664 


50736 


LSCC-86 


665 


50738 


LSCC-88 


666 


50740 


LSCC-89 


667 


50741 


LSCC-92 


668 


50744 


LSCC-93 


669 


50745 


LSCC-103 


670 


50754 


LSCC-105 


671 


50756 


LSCC-106 


672 


50757 


LSCC-110 


673 


50761 
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CLONE NAME 


SEQ. ID. NO: 


CLONE ID # 


LSCC-112 


674 


50763 


LSCC-116 


675 


50767 


LSCC-145 


676 


50775 


LSCC-146 


677 


50776 


LSCC-147 


678 


50777 


LSCC-156 


679 


50786 


LSCC-157 


680 


50787 


LSCC-159 


681 


50789 


LSCC-167 


682 


51003 


LSCC-171 


683 


51007 


LSCC-178 


684 


51014 


LSCC-207 


685 


51304 


LSCC-239 


686 


51568 



Table 10 



Seq. 
ID. No. 


GenBank 
(ACCESS. #) 


Description 


648 


D21094 


Human mRNA for motor protein 


652 


NM 004487 


Homo sapiens golgi autoantigen, golgin subfamily b, 
macro golgin w/transmembrane signal 


653 


J04031 


Human methylenetetrahydrofolate dehydrogenase- 
methenyltetrahydrofolatecyclohydrolase- 
formyltetrahydrofolate synthetase mRNA 


654 


MN 007086 


Homo sapiens AND-1 protein (AND-1), mRNA 


657 


J03483 


Human chromogranin A mRNA 


658 


AF191340 


Homo sapiens anaphase-promoting complex subunit 7 
(APC7) 


661 


AC020663 


Homo sapiens chromosome 1 6 clone RPC1-11 127120 


662 


D13388 


Human mRNA for DnaJ protein homolog 


663 


AB014540 


Homo sapiens mRNA for KIAA0640 protein, partial cds 


666 


NM_005898 


Homo sapiens membrane component, chromosome 1 1, 
surface marker 1 (Ml 1S1) 


667 


X75304 


H. sapiens giantin mRNA 


668 


Z29067 


H.sapiens AF-lp mRNA 


669 


AJ133129 


H.sapiens mRNA for small glutamine-rich tetratricopeptide 
repeat containing protein 


670 


AF058918 


Homo sapiens unknown mRNA 


671 


D89976 


H.sapiens mRNA for 5-aminoimidazole-4-carboxamide 
ribonucleotide transformylase 


672 


NM_001539 


Homo sapiens heat shock protein, DNAJ-like 2 (HS J2) 
mRNA 


673 


AC020663 


Homo sapiens chromosome 16 clone RPCI-1 1-127120 


674 


D21235 


Human mRNA for HHR23A protein 
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Seq. 
ID. No. 


GenBank 
(ACCESS. #) 


Description 


676 


MN 003804 


Homo sapiens receptor (TNFRSF)-interacting serine- 
threonine kinase 1 (RIPK1) 


677 


X76180 


H. sapiens inRNA for lung amiloride sensitive Na+ channel 
Protein 


678 


AB018330 


Homo sapiens mRNA for KIAA0787 protein, partial cds 


U87803 


Human putative ca2+/calmodulin-dependent protein kinase 
gene, 3 ' flanking region 


679 


L31610 


Homo sapiens (clone cori-lcl5) S29 ribosomal protein 
mRNA 


680 


Z83840 


Human DNA sequence from clone CTA-216E10 on 
chromosome 22 contains the NHP2L1 gene for non-histone 
chromosome protein 2 


682 


D14696 


Human mRNA for KIAA0108 gene 


683 


Z47087 


H. sapiens mRNA for RNA polymerase II elongation factor- 
like protein 


684 


Z83840 


Human DNA sequence from clone CTA-216E10 on 
chromosome 22 contains the NHP2L1 gene 


685 


U01923 


Human BTK region clone ftp-3 mRNA 



Table 11 



Seq. 
ID. 


GenBank 




No. 


(ACCESS. #) 


Description 


649 




Novel 


650 


AC005023 


Unknown: Homo sapiens BAC clone GS 1-42 113 from 
Xq25-q26 


651 




Novel 


655 


AC007199 


Unknown: Homo sapiens chromosome 5 BAC clone 
lllnl3 


656 


AC005988 


Unknown: Homo sapiens chromosome 17, clone 
hRPK.299 G 24 


659 


AK001695 


Unknown: Homo sapiens cDNA FLJ10833 fis, clone 
NT2RP4001206, moderately similar to Drosophila 
melanogaster strawberry notch mRNA 


660 


AK001722 


Unknown: Homo sapiens cDNA FLJ10860 fis, clone 
NT2RP4001568, weakly similar to ZINC FINGER 
PROTEIN GCS1 


664 


AK001925 


Unknown: Homo sapiens cDNA FLJ1 1063 fis, clone 
PLACE1004814, weakly similar to SPLICING FACTOR, 
ARGININE/SERINE-RICH 4 


665 




Novel 


675 


(AJ131096) 


Novel (1 to 103 bp is Picea abies microsatellite RNA), clone 
PAAG2 
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Seq. 
ID. 
No. 


GenBank 
(ACCESS. #) 


Description 


681 


AP001065 


Unknown: Homo sapiens genomic DNA, chromosome 21, 
clone:KB68A7, MX-D21S171 region 


686 




Novel 



In further studies, the expression levels of certain of these disclosed 
isolated antigens were compared to the expression levels in 36 normal tissues using 
microarray technology and computer analysis. These sequences were arrayed on Chip 
5 #7. The results of these studies are shown below in Table 12. 



Table 12 



Clone 
Name 


Clone lit ft 


btH2 iu syu. 


isqua/f\ 


Aden/IN 


SC/N 


LSCC-8 


50664 


648 


4 


3 


2 


LSCC-13 


50669 


649 


2 


4 


0 


LSCC-59 


50711 


658 


4 


2 


3 


LSCC-84 


50736 


664 


6 


3 


4 


LSCC-86 


50738 


665 


1 


4 


0 


LSCC-88 


50740 


666 


2 


3 


4 


LSCC-92 


50744 


668 


3 


1 


1 


LSCC-105 


50756 


671 


4 


3 


2 


LSCC-106 


50757 


672 


4 


3 


1 


LSCC-110 


50761 


673 


8 


3 


4 


LSCC-146 


50776 


677 


3 


1 


1 


LSCC-147 


50777 


678 


5 


2 


3 


LSCC-156 


50786 


679 


4 


2 


2 


LSCC-159 


50789 


681 


2 


2 


1 


LSCC-171 


51007 


683 


2 


1 


1 


LSCC-207 


51304 


685 


3 


4 


3 


LSCC-239 


51568 


686 


4 


3 


2 



Squa/N = Squamous lung tumor versus Normal tissues 
10 Aden/N = Adenocarcinoma over versus tissues 

SC/N = Lung Small Cell carcinoma versus Normal tissues 
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Example 13 

Use of Mouse Antisera to Identify cDNA Sequences Encoding Lung Small Cell 
Carcinoma Antigens 

This example illustrates the isolation of cDNA sequences encoding lung 
5 small cell carcinoma antigens by screening a small cell carcinoma cell line cDNA 
library with mouse anti-SCID mouse sera. 

A directional cDNA expression library was constructed with mRNA 
from a SCID-passaged human lung cancer tumor DMS79 employing the Lambda ZAP 
Express expression system (Stratagene, La Jolla, CA). Sera was obtained from SCID 

10 mice containing the human lung cancer tumors DMS79 and NCIH688. The sera 
contains shed and secreted tumor antigens. These sera were pooled and injected into 
normal mice to produce anti-SCID mouse sera. The antiserum was absorbed with E. 
coli lysate, human GADPH protein and Ku autoantigens, and human PBMC lysate was 
added to the serum to block antibodies to proteins found in normal tissue. 

15 Forty-one clones were isolated and the inserts of these clones were 

sequenced. The isolated clones and their respective sequence identifiers are presented 
in Table 13. The clone sequences were compared to sequences in publicly available 
databases. A summary of these comparisons are found in Tables 14 and 15. Those 
showing some degree of similarity with known sequences are described in Table 14, 

20 while those showing little or no similarity to known sequences are found in Table 15. 



Table 13 



CLONE NAME 


SEQ. ID. NO:. 


CLONE ID # 


DMS-3 


687 


48564 


DMS-8 


688 


48567 


DMS-9 


689 


48568 


DMS-12 


690 


48571 


DMS-14 


691 


45572 


DMS-25 


692 


48578 


DMS-3 5 


693 


48583 


DMS-3 8 


694 


48584 


DMS-39 


695 


48585 


DMS-47 


696 


49059 


DMS-50 


697 


49061 


DMS-53 


698 


49065 


DMS-61 


699 


49070 


DMS-63 


700 


49072 


DMS-64 


701 


49073 


DMS-67 


702 


49076 
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CLONE NAME 


SEQ.ID.NO:. 


CLONE ID # 


DMS-75 


703 


50793 


DMS-76 


704 


50794 


DMS-79 


705 


50797 


DMS-84 


706 


50800 


DMS-93 


707 


50805 


DMS-126 


708 


50984 


DMS-129 


709 


50986 


DMS-139 


710 


51065 


DMS-151 


711 


51070 


DMS-164 


712 


51078 


DMS-168 


713 


51080 


DMS-175 


714 


51084 


DMS-193 


715 


51095 


DMS-1 99 


716 


51099 


DMS-200 


717 


51100 


DMS-204 


718 


51103 


DMS-214 


719 


51112 


DMS-218 


720 


51113 


DMS-221 


721 


51116 


DMS-232 


722 


51123 


DMS-253 


723 


51212 


DMS-270 


724 


51220 


DMS-275 


725 


51224 


DMS-289 


726 


51234 


DMS-296 


727 


51239 



Table 14 


SEQIDNO: 


GenBank 


687 


KIAA0013:cDNA from Hu. BM myeloblast line 


688 


Hu. Homolog Mu. LLRep3, sim. To ribosomal S2 


689 


KIAA0769, Hu. brain protein 


690 


Hu. Thymidylate kinase (CDC9), regul'n 


691 


Hu. Ku automimmune Ag; Nuc. Fctr. IV 


692 


Hu. PolyubiquitinUbC 


693 


Hu. FLJ20423 fis (signet-ring cell care. Celline) 


694 


KIAA0640, SWAP-70 (Hu, brain protein) 


695 


Human radixin (cytosleletal) 


696 


Hu. Ribosomal protein LI 3a 


697 


Hu. trk oncogene, cytoskltl. Tropomyosin 


698 


DKFZp586K2 1 20 (uterus) KIAA0784 (brain) 


699 


Hu. Chromogranin A (parathyr. Secrtry. Pro. 1) 


700 


Hu. Glutathione- S -transferase homolog 


701 


Hu. lactate dehydrogenase-A 
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SEQ ID NO: 


GenBank 


702 


Hu.GPI-anchored membr. Pro. pi 37 


704 


Hu.HMG-17 


705 


Hu. Ubiquitin C-terminal hydrolase (UHX1) 


706 


Hu. Cosmid 25, PAC clone RP5-901A4 


707 


Hu. lactate dehydrogenase B 


708 


Hu. NuMA gene 


709 


Hu. KIAA0008 gene 


710 


Hu. BCL2/adenovirus E1B pro.2 (BNIP2) 


711 


Hu. Unactive progesterone receptor P23 


712 


Hu. alpha II spectrin 


713 


Hu. Transcriptional coactivator ALY 


714 


,Hu. DnaJ Heat Shock homolog 


715 


Hu. mitoch. Or Replication 


716 


Hu. Ornithine decarboxylase antizyme (brain) 


717 


Hu. Deoxycytidine kinase 


718 


Hu. Fumarase 


719 


Hu. 80K-H protein (kinase C substrate) 


721 


Hu. Neuro-d4 (rat) homolog 


722 


Hu. Sodium/glucose cotransporter, repeat 


724 


Hu. Zinc finger protein ZNF226 


725 


Hu. Jumping transloc'n brkpt. Gene 


726 


Hu. M-phase phosphoprotein 


727 


Hu. Peroxisomal signal receptor 1 



Table 15 



SEQ ID NO: 


GenBank 


703 


Novel 


720 


Novel (ALU?) 


723 


Novel 



5 Example 14 

Analysis of cDNA Expression using Microarray Technology 

In additional studies, four clones obtained in Example 13 were found to 
be overexpressed in specific tumor tissues as determined by microarray analysis. Using 
this approach, cDNA sequences are PCR amplified and their mRNA expression profiles 
10 in tumor and normal tissues were examined using cDNA microarray technology 
essentially as described (Shena et ah, 1995). In brief, the clones are arrayed onto glass 
slides as multiple replicas, with each location corresponding to a unique cDNA clone 
(as many as 5500 clones can be arrayed on a single slide, or chip). Each chip is 
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hybridized with a pair of cDNA probes that are fluorescence-labeled with Cy3 and Cy5, 
respectively. Typically, lug of polyA + RNA is used to generate each cDNA probe. 
After hybridization, the chips are scanned and the fluorescence intensity recorded for 
both Cy3 and Cy5 channels. There are multiple built-in quality control steps. First, the 
5 probe quality is generally monitored using a panel of ubiquitously expressed genes. 
Secondly, the control plate also can include yeast DNA fragments of which 
complementary RNA may be spiked into the probe synthesis for measuring the quality 
of the probe and the sensitivity of the analysis. Currently, the technology offers a 
sensitivity of about 1 in 100,000 copies of mRNA. Finally, the reproducibility of this 
10 technology can be ensured by including duplicated control cDNA elements at different 
locations. 

The extended predicted full length sequences for partial sequences of 
clones, DMS39, DMS126, DMS218 and DMS253 (previously isolated in Example 13) 
were obtained from the GenBank databases after database searches using the original 
15 partial cDNA sequences as the query. The predicted full length sequences for the 
cloned cDNA sequence for clones DMS39, DMS126, DMS218 and DMS253 are 
provided in SEQ ID NO.728-731, respectively. SEQ ID NO.728-731 were analyzed by 
comparison to sequences in the publicly available databases. A summary of these 
comparisons is presented in Table 16. 

20 

Table 16 



SEQ ID NO: 


Clone Name 


Biastn 


728 


DMS-39 


Human radixin 


729 


DMS-126 


Human nuclear mitotic apparatus 
protein 


730 


DMS-218 


Hu. cDNA: FLJ21840 fis; XPMC2 


731 


DMS-253 


Hu. mRNA for KIAA1582 protein 



Example 15 

Analysis of cDNA Expression using Micro array Technology 

25 In an additional study, a clone obtained in Example 12 was found to be 

overexpressed in specific tumor tissues as determined by microarray analysis. Using 
this approach, the cDNA sequence is PCR amplified and its mRNA expression profiles 
in tumor and normal tissues was examined using cDNA microarray technology as 
described in Example 13. Microarray analysis showed the cDNA for LSCC-86 is 

30 strongly overexpressed in small cell carcinoma cell line HTB 173; moderately 
overexpressed in atypical carcinoid METs, adenocarcinoma lung tumors and squamous 
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lung tumors; and slightly overexpressed in primary small cell. This cDNA is also 
strongly overexpressed in pituitary gland; moderately overexpressed in brain and 
adrenal gland; and slightly overexpressed in skeletal muscle. 

Clone LSCC-86 was originally isolated in Example 12 and a partial 
5 sequence of this insert is provided in SEQ ID NO:665. An extended sequence was 
obtained by PCR sequencing using internal primer sequences designed from the partial 
cDNA sequence of clone LSCC-86. This extended sequence represents the full-length 
sequence for the cloned cDNA sequence of clone LSCC-86. The determined full length 
sequence for LSCC-86 is provided in SEQ ID NO:732. SEQ ID NO:732 was analyzed 

10 by comparison to sequences in the publicly available databases. Database searches 
showed no homology in GenBank, seven ESTs (3 lung tumor and 4 uncatagorized hits) 
in the human EST database, and no homology in Blastx. Three open reading frames 
(ORFs) were identified. A first that encodes a protein with a sequence of 50 amino acid 
residues (SEQ ID NO:733) which is fused to LacZ. A second that encodes a protein 

15 with a sequence of 76 amino acids residues (SEQ ID NO:734) which shows no 
homology in the databases. A third that encodes a protein with a sequence of 74 amino 
acid residues (SEQ ID NO:735) which also shows no homology in the databases. 
However, a motif search of SEQ ID NO:735 shows a possible small cytokine signature. 

20 Example 16 

Quantitative Real-time PCR Analysis using cDNAs Identified by T-cell 
Expression Cloning 
In this Example, the nucleic acid sequence of the cDNA inserts 
contained in clones L86S-39 and L86S-47 (SEQ ID NOs:89 and 90), identified by T- 
25 cell {i.e., serological) expression cloning, were used in Real-time PCR mediated 
expression analysis of a variety of tissues, including lung tumor, normal lung and other 
normal tissue samples. 

Briefly, the first-strand cDNA was synthesized from 20ug of total RNA 
that had been treated with DNase I (Amplification Grade, Gibco BRL Life Technology, 
30 Gaitherburg, MD), using Superscript Reverse Transcriptase (RT) (Gibco BRL Life 
Technology, Gaitherburg, MD). Real-time PCR is performed with a GeneAmp™ 5700 
sequence detection system (PE Biosystems, Foster City, CA). The 5700 system uses 
S YBR™ green, a fluorescent dye that only intercalates into double stranded DNA, and a 
set of gene-specific forward and reverse primers. The increase in fluorescence is 
35 monitored throughout amplification process. The optimal concentration of primers was 
determined using a pool of cDNAs from lung tumors, according to procedures known to 
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those of ordinary skill in the art. The PCR reaction was performed in 25ul volumes that 
include 2.5ul of SYBR green buffer, 2ul of cDNA template and 2.5ul each of the 
forward and reverse primers for the gene of interest. The cDNAs used for RT reactions 
are diluted 1:10 for each gene of interest and 1:100 for the P-actin control. In order to 
5 quantitate the amount of specific cDNA (and hence initial mRNA) in the sample, a 
standard curve is generated for each run using the plasmid DNA containing the gene of 
interest. Standard curves are generated using the Ct values determined in the Real-time 
PCR, which are related to the initial cDNA concentration used in the assay. Standard 
dilutions ranging from 20-2x1 0 6 copies of the gene of interest are used for this purpose. 

10 In addition, a standard curve is generated for p-actin ranging from 200fg-2000fg. This 
enables standardization of the initial RNA content of a tissue sample to the amount of 
P-actin for comparison purposes. The mean copy number for each group of tissues 
tested is normalized to a constant amount of p-actin, allowing the evaluation of the 
overexpression levels seen with each of the genes. 

15 Real-time PCR analysis performed as described above demonstrated that 

mRNAs corresponding to the cDNA inserts contained in clones L86S-39 and L86S-47 
are overexpressed in 2 out of 6 squamous lung tumors and in 2 out of 4 head and neck 
tumors, with lower levels of expression detected in an additional 3 squamous and in 2 
head and neck tumors. Little or no expression was detected in normal lung tissue. 

20 Some expression was detected in soft palate, tonsil, trachea, esophagus, salivary gland, 
bronchus and cervix. 

Example 17 

Identification of an Extended Polynucleotide Sequence and Corresponding 
25 Amino Acid Sequence For cDNA Inserts Contained in Lung Tumor Clones 
L86S-39ANDL86S-47 
This Example describes the determined extended nucleic acid sequence 
for the full-length cDNA inserts contained in clones L86S-39 and L86S-47, 
corresponding to lung tumor antigens identified by T-cell (i.e., serological) expression 
30 cloning. 

A determined 5' polynucleotide sequence of the cDNA insert contained 
in clone L86S-47 has been disclosed in SEQ ID NO:90, which corresponds to the amino 
acid sequence set forth in SEQ ID NO:99. In this Example, an extended nucleic acid 
sequence of the full-length insert contained in clone L86S-47 has been determined, as 
35 disclosed in SEQ ID NO:736. The amino acid sequence of a corresponding open 
reading frame contained therein is provided in SEQ ID NO:738. 
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The determined 5' polynucleotide sequence of the cDNA insert 
contained in clone L86S-39 has been disclosed in SEQ ID NO:89, which corresponds to 
the amino acid sequence set froth in SEQ ID NO:98. In this Example, an extended 
nucleic acid sequence of the full-length insert contained in clone L86S-39 has been 
5 determined, as disclosed in SEQ ID NO:737. The deduced amino acid sequence of a 
corresponding open reading frame contained therein is provided in SEQ ID NO:739. 

In view of the lung tumor-associated expression profile of these 
sequences, the nucleic acid and/or amino acid sequences as described above can be 
used, for example, in a variety of diagnostic and/or therapeutic applications associated 
10 with lung cancer, illustrative examples of which are described hereinabove. 

Example 18 

PCR Cloning and Identification of an Extended cDNA Encoding a Full-Length 
L200T Polypeptide 

15 In this Example, an extended cDNA sequence (SEQ ID NO:740), related 

to the polynucleotide identified in Example 8 (SEQ ID NO:440), was identified using 
an approach which combined (i) anchored PCR mediated subcloning and nucleic acid 
sequence determination, with (ii) sequence analysis of an overlapping human genomic 
DNA (gDNA) clone. This analysis was used to assemble the extended nucleic acid 

20 sequence identified in SEQ ID NO:740, including an extended open reading frame 
(SEQ ID NO:741) encoding a full-length L200T polypeptide having an amino acid 
sequence disclosed in SEQ ID NO:742. 

In Example 8, SEQ ID NO:440 was shown to be related to a nucleic acid 
sequence encoding a truncated human cystine/glutamate transporter (GenSeq No. 

25 Z16609; herein referred to as SEQ ID NO:743). However, unlike the . published 
sequence of the human cystine/glutamate transporter, SEQ ID NO: 440 contains a 
unique 5' terminus of 181 nucleotides. As disclosed in Example 8, this novel 5' 
sequence results in the removal of the reported initiating methionine (start codon) along 
with an additional two amino acids of the published transporter protein sequence. 

30 Accordingly, the deduced translation product of SEQ ID NO: 740 represents a 
polypeptide (i.e., SEQ ID NO:742) encoded by a translation reading frame that is clearly 
distinct from that used to translate the cystine/glutamate transporter protein. 

However, although SEQ ID NO:440 contains an ATG that could 
function as a translation initiation codon, no 5' inframe (upstream) stop codon was 

35 identified therein. Therefore, in order to further evaluate this open reading frame, 
anchored PCR mediated cloning of a cDNA library was used to identify cDNAs 
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containing additional 5' extended nucleic acid sequence, encoding additional amino 
terminal amino acids of the polypeptide referred to as L200T. Anchored PCR cloning 
identified an additional 47 nucleotides 5' of the first nucleotide of SEQ ID NO:440. 
This extended 5' nucleotide sequence disclosed in SEQ ID NO: 744, is also contained in 
5 SEQ ID NO:740 (nucleotide positions 38-84). Accordingly, this analysis identified an 
ATG translation initiation codon that extends the deduced amino acid sequence of the 
open reading frame in SEQ ID NO:440 by an additional 16 amino acids, as set forth in 
SEQIDNO:745. 

Further analysis of L200T nucleic acid and deduced amino acid sequence 

10 proceeded as follows. The 5' extended sequence (SEQ ID NO:744), contained in SEQ 
ID NO:740, includes an ATG translation start codon that is further upstream and 
inframe with the ATG codon previously identified in SEQ ID NO:440. Using the 5' 
extended cDNA sequence contained in SEQ ID NO: 744 and sequence contained in SEQ 
ID NO:440, bioinformatic analysis of a corresponding human genomic DNA sequence 

15 was used to identify an inframe stop codon upstream of the 5' most ATG translation 
start codon, as shown in the composite 5' extended cDNA sequence set forth in SEQ ID 
NO:740. Accordingly, the inframe stop codon (TGA) in SEQ ID NO:740 is identified 
by the T residue positioned 42 nucleotides 5' of the A of the translation initiation codon 
(ATG) of the full-length open reading frame sequence encoding L200T (SEQ ID 

20 NO: 741), contained therein. The clone containing this overlapping genomic DNA 
(gDNA) sequence (nucleotides 1-37 of SEQ ID NO:740) providing an inframe stop 
codon is contained within human chromosome 4, as identified by Genbank Accession 
No. AC093903 (SEQ ID NO:746). In this manner, anchored PCR cloning coupled with 
examination of overlapping genomic DNA sequence was used to establish the extended 

25 5' polynucleotide sequence according to SEQ ID NO:740, and to identify a full-length 
open reading frame disclosed in SEQ ID NO: 741, which is predicted to encode a 
corresponding full-length amino acid sequence (SEQ ID NO:742), the full-length 
polypeptide referred to as L200T. Consistent with analysis of SEQ ID NO:440 (see 
Example 8), the full-length amino acid sequence of SEQ ID NO:742 contains the CD8 + 

30 T-cell epitope of SEQ ID NO:587, which is reactive to the lung tumor cell line LT-391- 
06. 

The extended nucleotide coding sequence contained in SEQ ID NO: 741 
includes nucleotide sequence determined by anchored PCR from a cDNA library, while 
the inframe stop codon disclosed in SEQ ID NO: 740 was identified by analysis of an 
35 overlapping gDNA sequence, as discussed above. Accordingly, the genomic nucleotide 
sequence (161,280 base pairs) of human chromosome 4 (GenBank Accession No. 
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AC093903), is disclosed in SEQ ID NO:746. In addition, a 17,672 nucleotide sequence 
derived from SEQ ID NO:746, which contains exon 1, intron 1 and exon 2 of L200T is 
provided in SEQ ID NO:747. The upstream inframe stop codon of SEQ ID NO:740, 
proximal to the 5' end of the anchored PCR cDNA nucleotide sequence, is located at 
5 nucleotide position 7,153 of SEQ ID NO:747. The "ATG" translation start codon 
(identified in SEQ ID NO: 741) is located at nucleotide position 7,195 of SEQ ID 
NO:747, and the anchored PCR cDNA sequence starts at nucleotide position 7,190. 
Therefore, L200T exon 1 encompasses nucleotides 7,153 through 7,426, intron 1 
corresponds to nucleotides 7,427 through 17,403 and exon 2 begins at nucleotide 

10 17,404. The L200T CD8 + epitope (SEQ ID NO:587), as identified in Example 8, is 
encoded by nucleotide sequences 7,360 through 7,386, of SEQ ID NO:747. 

Also identified in this analysis is a difference between the gDNA 
sequence and the anchored PCR cDNA sequence. This difference is identified as an 
"A" residue in the gDNA nucleotide sequence at position 7,198 of SEQ ID NO:747 and 

15 as a "G" residue at nucleotide position 4 of the extended open reading frame sequence 
(SEQ ID NO:741). This nucleotide sequence difference results in the deduced L200T 
amino acid of residue 2 being a Ser when referenced to gDNA sequence and Gly when 
referenced to the extended nucleotide sequence of SEQ ID NO:740. In addition, this 
analysis further identifies that the nucleic acid sequence encoding the human 

20 cysteine/glutamate transporter (GenSeq No. Z 16609) starts at nucleotide position 
17,134 of SEQ ID NO:747, which is located within intron 1 sequence of the L200T 
gene. Thus, the above analysis supports, as discussed in Example 8, that SEQ ID 
NO:740 and GenSeq No. Z16609 (SEQ ID NO:743) share overlapping nucleic acid 
sequence, however, encode distinctly different polypeptides, an L200T polypeptide and 

25 the cyctine/glutamate transporter, respectively. 

Thus L200T nucleic acid and/or amino acid sequence, as described 
above, can be used in a variety of polynucleotide or polypeptide based diagnostic and/or 
therapeutic applications associated with lung cancer. 

30 Example 19 

Expression of Recombinant Truncated L200T in Prokaryotic Host Cells 

In this example, an open reading frame contained in SEQ ID NO:440 
identified in Example 8, which encodes an L200T polypeptide, was subcloned and 
recombinant protein expressed in E. coli host cells. 
35 Briefly, an open reading frame of SEQ ID NO:440 was PCR amplified 

using the forward oligonucleotide primer 
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5 ' G AGGTTG AAGTG AGC AGAGATC ATGCC 3' (SEQ ID NO:748) and reverse 
oligonucleotide primer 5' CTTACGAATTCATCAGCT-GCACTTTCTCCTGC 3' 
(SEQ ID NO:749). The PCR amplification using Pfu DNA polymerase (Stratagene, La 
Jolla, CA) was performed as follows: one cycle at 96°C for 2 minutes; 40 cycles of 96°C 
5 for 20 seconds, 62°C for 15 seconds, 72°C for 45 seconds; one cycle at 72°C for 4 
minutes. The PCR product was digested with restriction endonuclease EcoRI and 
cloned (ligated) into the expression vector pPDM Trx2H (a modified pET28 vector that 
includes an inframe thioredoxin (Trx) encoding sequence flanked, both 5' and 3', with 
6xHis tags) that had been digested with StuI and EcoRI. The ligation reaction was 

10 transformed into BLR (DE3) pLysS and HMS 174 pLysS competent bacteria (Novagen 
Inc., Madison, WS). Recombinant clones were identified, plasmid DNA prepared and 
the nucleotide sequence of the insert determined. Expression of the encoded 
recombinant 6xHis-Trx-6xHis-L200T polypeptide fusion protein of the expected size 
was confirmed in coomassie stained gels. In addition, the fusion protein described 

15 above also includes several protease cleavage sites (enterokinase, thrombin and XA) 
located between the second 6xHis sequence and L200T coding sequence that are useful 
in the analysis, purification and preparation of recombinant L200T polypeptide. 

The PCR mediated subcloning of truncated L200T into the above- 
identified modified pET28 vector directs the expression of recombinant truncated 

20 L200T containing an amino terminal fusion with 6xhistidine epitope tags and 
thioredoxin (Trx). Accordingly, recombinant protein expressed in E. coli host cells may 
be detected with commercially available anti-thioredoxin antibody and/or anti- 
6xHistidine antibody. Addition of thioredoxin as part of the amino terminal fusion 
protein allowed for the expression of detectable levels of recombinant truncated L200T 

25 fusion protein in E. coli host cells. 

Recombinant L200T produced in this manner can be used, for example, to prepare 
monoclonal and/or polyclonal antibodies, to detect antibodies in patient sera, and/or in a 
variety of other diagnostic and/or therapeutic applications associated with lung caner. 

30 From the foregoing it will be appreciated that, although specific 

embodiments of the invention have been described herein for purposes of illustration, 
various modifications may be made without deviating from the spirit and scope of the 
invention. Accordingly, the invention is not limited except as by the appended claims. 
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CLAIMS 

What is Claimed: 

1. An isolated polynucleotide comprising a sequence selected from the 
group consisting of: 

(a) sequences provided in SEQ ID NOs: 740, 588, 598, 603, 611, 616, 
622, 625, 629, 636, 639, 641, 649-651, 655, 656, 659, 660, 664, 665, 675, 681, 686, 703, 720, 
723, 732, 736, 737, 741, 744 and 746; 

(b) complements of the sequences provided in SEQ ID NOs: 740, 588, 
598, 603, 611, 616, 622, 625, 629, 636, 639, 641, 649-651, 655, 656, 659, 660, 664, 665, 675, 
681, 686, 703, 720, 723, 732, 736, 737, 741, 744 and 746; 

(c) sequences consisting of at least 20 contiguous residues of a sequence 
provided in SEQ ID NOs: 740, 588, 598, 603, 611, 616, 622, 625, 629, 636, 639, 641, 649- 
651, 655, 656, 659, 660, 664, 665, 675, 681, 686, 703, 720, 723, 732, 736, 737, 741, 744 and 
746; 

(d) sequences that hybridize to a sequence provided in SEQ ID NOs: 740, 
588, 598, 603, 611, 616, 622, 625, 629, 636, 639, 641, 649-651, 655, 656, 659, 660, 664, 665, 
675, 681, 686, 703, 720, 723, 732, 736, 737, 741, 744 and 746, under moderately stringent 
conditions; 

(e) sequences having at least 75% identity to a sequence of SEQ ID NOs: 
740, 588, 598, 603, 611, 616, 622, 625, 629, 636, 639, 641, 649-651, 655, 656, 659, 660, 664, 
665, 675, 681, 686, 703, 720, 723, 732, 736, 737, 741, 744 and 746; 

(f) sequences having at least 90% identity to a sequence of SEQ ID NOs: 
740, 588, 598, 603, 611, 616, 622, 625, 629, 636, 639, 641, 649-651, 655, 656, 659, 660, 664, 
665, 675, 681, 686, 703, 720, 723, 732, 736, 737, 741, 744 and 746; and 

(g) degenerate variants of a sequence provided in SEQ ID NOs: 740, 588, 
598, 603, 611, 616, 622, 625, 629, 636, 639, 641, 649-651, 655, 656, 659, 660, 664, 665, 675, 
681, 686, 703, 720, 723, 732, 736, 737, 741, 744 and 746. 

2. An isolated polypeptide comprising an amino acid sequence selected 
from the group consisting of: 

(a) sequences encoded by a polynucleotide of claim 1; and 

(b) sequences having at least 70% identity to a sequence encoded by a 
polynucleotide of claim 1; and 

(c) sequences having at least 90% identity to a sequence encoded by a 
polynucleotide of claim 1; and 



136 



WO 02/092001 



PCT7US02/14975 



(d) at least a portion of an amino acid sequence set forth in SEQ ID NOs: 
742, 733, 734, 735, 745, 738 and 739. 

3. An expression vector comprising a polynucleotide of claim 1 operably 
linked to an expression control sequence. 

4. A host cell transformed or transfected with an expression vector 
according to claim 3. 

5. An isolated antibody, or antigen-binding fragment thereof, that 
specifically binds to a polypeptide of claim 2. 

6. A method for detecting the presence of a cancer in a patient, 
comprising the steps of: 

(a) obtaining a biological sample from the patient; 

(b) contacting the biological sample with a binding agent that binds to a 
polypeptide of claim 2; 

(c) detecting in the sample an amount of polypeptide that binds to the 
binding agent; and 

(d) comparing the amount of polypeptide to a predetermined cut-off value 
and therefrom determining the presence of a cancer in the patient. 

7. A fusion protein comprising at least one polypeptide according to 

claim 2. 

8. An oligonucleotide that hybridizes to a sequence recited in SEQ ID 
NOs: 740, 588, 598, 603, 611, 616, 622, 625, 629, 636, 639, 641, 649-651, 655, 656, 659, 
660, 664, 665, 675, 681, 686, 703, 720, 723, 732, 736, 737, 741, 744 and 746 under 
moderately stringent conditions. 

9. A method for stimulating and/or expanding T cells specific for a tumor 
protein, comprising contacting T cells with at least one component selected from the group 
consisting of: 

(a) polypeptides according to claim 2; 

(b) polynucleotides according to claim 1; and 
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(c) antigen-presenting cells that express a polypeptide according to claim 

2, 

under conditions and for a time sufficient to permit the stimulation and/or 
expansion of T cells. 

10. An isolated T cell population, comprising T cells prepared according to 
the method of claim 9. 

11. A composition comprising a first component selected from the group 
consisting of physiologically acceptable carriers and immunostimulants, and a second 
component selected from the group consisting of: 

(a) polypeptides according to claim 2; 

(b) polynucleotides according to claim 1; 

(c) polynucleotides having a sequence as provided in any one of SEQ ID 
NOs: 740, 589-597, 599-602, 604-610, 612-615, 617-621, 623, 624, 626-628, 630-635, 637, 
638, 640, 642-648, 652-654, 657, 658, 661-663, 666-674, 676-680, 682-685, 687-702, 704- 
719, 721, 722, 724-731, 736, 737, 741, 744 and 746; 

(d) antibodies according to claim 5; 

(e) fusion proteins according to claim 7; 

(f) T cell populations according to claim 10; and 

(g) antigen presenting cells that express a polypeptide according to 

claim 2. 



12. A method for stimulating an immune response in a patient, comprising 
administering to the patient a composition of claim 11. 

13. A method for the treatment of a cancer in a patient, comprising 
administering to the patient a composition of claim 11. 

14. A method for determining the presence of a cancer in a patient, 
comprising the steps of: 

(a) obtaining a biological sample from the patient; 

(b) contacting the biological sample with an oligonucleotide according to 

claim 8; 

(c) detecting in the sample an amount of a polynucleotide that hybridizes 
to the oligonucleotide; and 
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(d) compare the amount of polynucleotide that hybridizes to the 
oligonucleotide to a predetermined cut-off value, and therefrom determining the presence of 
the cancer in the patient. 

15. A diagnostic kit comprising at least one oligonucleotide according to 

claim 8. 



16. A diagnostic kit comprising at least one antibody according to claim 5 
and a detection reagent, wherein the detection reagent comprises a reporter group. 

17. A method for inhibiting the development of a cancer in a patient, 
comprising the steps of: 

(a) incubating CD4+ and/or CD8+ T cells isolated from a patient with at 
least one component selected from the group consisting of: (i) polypeptides according to 
claim 2; (ii) polynucleotides according to claim 1; and (hi) antigen presenting cells that 
express a polypeptide of claim 2, such that T cell proliferate; 

(b) administering to the patient an effective amount of the proliferated T 

cells, 

and thereby inhibiting the development of a cancer in the patient. 



139 



WO 02/092001 



1 



PCT7US02/14975 



SEQUENCE LISTING 



<110> Corixa Corporation 
Lodes, Michael J. 
Wang, Tongtong 
Fan, Liqun 
Algate, Paul A. 
McNeill, Patricia D. 



<120> COMPOSITIONS AND METHOD^ FOR 

THE THERAPY AND DIAGNOSIS OF LUNG CANCER 



<130> 210121. 47502PC 

<140> PCT 

<141> 2002-05-10 

<160> 749 

<170> FastSEQ for Windows Version 3.0 

<210> 1 

<211> 339 

<212> DNA 

<213> Homo sapiens 

<400> 1 

gtactcagac aggatagtca tcatgtagca caaagcamat cctgtttcta tacttgtagt 60 
ttgctctcac tcagtggcat ratcattact atacagtgta gaatgttrtt atgtagcata 120 
gatgtggggt ctctagccca cagctctsta cctttgtcta gcactcctgt cctcatacct 180 
ragtggcctg tccatcagca tgtttctcat ctactttgct tgtccagtcc actgtggtcc 240 
tcccttgccc tctcccttat gtggcagagt ggaaccagct gtcctgagac ttgagttcaa 300 
catctggttc gcccatytgc atgtttgtgg tctgagtac 339 

<210> 2 

<211> 698 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 79, 459, 463, 466, 469, 585, 642, 645, 656, 660, 666, 670, 
671, 678, 686, 693 
<223> n = A,T,C or G 

<400> 2 

gtactcagac cacgactgca ttttctccac tgctgacggg tctaatacca gctgcttccc 60 
tttcttggag gcagagctng tgaccttgag aaagtgacct gtgaccatca tgtgggtagt 120 
gagctgctgc aaggtgtcat gggagctccc acactccatg cactttwaga tctgggactt 180 
gcaggcctca ractgccagg tgtagctcgc tccattttgg tagccatagc gsttgttgga 240 
ggacaactgc aagttggcgt tcttctgaga agaaaaagaa tctgcaaaag atcctgtggt 300 
tgaatcgggg gaacacggcc gattgacatc aaaaacgcgt ttcttagccc gggtgaccat 360 
tttcgaggaa atggttgggg actggctcct tcaaaggcac tttttggtta tgttttgttt 420 
yaatcatgtk gacgctccaa tcttggragg gaatcgaang rantcnccnc caaaacatrc 480 
stttcagraa ccttttgarc atcctctttt ttccgtrtcc cggmaargcc cytttccckg 540 
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ggctttgaaa wyagcctsgt tgggttctta aattaccart ccacnwgttg gaattccccg 600 
ggccccctgc ccggktccaa ccaattttgg graaaacccc cncansccgt tkggantgcn 660 
acaacntggn ntttttcntt tcgtgntccc ctngaacc 698 

<210> 3 

<211> 697 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 65, 83, 137, 140, 190, 231, 250, 280, 307, 372, 376, 453, 
467, 468, 474, 507, 515, 519, 520, 521, 527, 540, 541, 552, 
567, 572, 575, 576, 577, 582, 585, 607, 608, 610, 616, 617, 
624, 631, 636, 640, 646, 658, 669, 690 
<223> n = A,T,C or G 

<400> 3 

gtactcagac ccccaacctc 
agccngccag gccattgaag 
gggccaagta gctgcantan 
actatgtgtn aggggtacag 
cttttgcctn aacttggggt 
atggaanaaa atccgtattt 
tacagtccaa angatnttcc 
caaaatcttt tatttttttc 
ttcccatctc ccccattcca 
nccgaaaggg tntttttatt 
gcggccnngn gggttnnacc 
ccttgaaant tttcttttgg 

<210> 4 

<211> 712 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 32, 59, 102, 117, 220, 238, 246, 431, 435, 487, 578, 579, 
594, 603, 610 
<223> n = A,T,C or G 

<400> 4 

gtactcagac aaccaatagg tgtgttyctc anatctgaaa cacaaaaaga ttctagctna 60 
taatgttsaa tggttgaggg tttaagtgat cttggtatgt tngatttagc agcgatnggc 120 
cgggtgcggt ggctcacgca tgtatcccag cactttggga ggccgaggca ggaggatcac 180 
ctgaggtcag gagtttgaga ccagcctggc cgacatggtn aaaccccgtc tctactanga 240 
atacanaaat tagcccgggc atagtggcgc gtgcctrtga cctcsgctac tttggggatt 300 
ctcctgagga agaattgctt gaactcaggg aagtggargt ttgcagtgag cttaaatcaa 360 
gccactggca ctcccagcct gggktaacag agccamgact ctkgccgaaa aaaaaraama 420 
cgacggagaa nmagntctgt tattccatgg gaaattkgaa tttccttcyt tkaaatatct 480 
taaaatnggt cctcctwaaa aaagttcggc tggggcccgk tggctcacat tttkttaycc 540 
cycccccttt tggggarggc caarggccgg kttgawtnnc ccttgagggg ccanaactcc 600 
agnaaccrgn cccgggccar smgwkgkstr armccctttc cyyccmaraa aawwcsmaaa 660 
wwttycccsc cygsykggct ggkasckgtt myyyyygmtm csyagcttgc tt 712 

<210> 5 
<211> 679 



gaacagccag aagacaggtt 
ganaagcaaa gacgaagcga 
ccttcagtcc cagttgcatt 
aagcttttcc tcatagggca 
ttctgtggtt cataaagttn 
ggcaaaaaga ctccaggggg 
ccaaagaata gacatttttt 
ctttctcgca ccnccccaga 
cacgatnttg aattngcann 
cggggtnctg anttnnnaac 
atgntgggga naactncccn 
tcgccccccn gagattc 



gtctcctggg ccttggacac 60 
accatctctc tccattgtgg 120 
gggttaaaga gctcatacat 180 
tgagctctcc nagagttgac 240 
ggatatgtat tttttttcaa 300 
atgatactgt ccttgccact 360 
cctctcatca cttctggatg 420 
ccccttnnag gttnaaccgc 480 
ncgttgntgg tcgggtcccn 540 
cnctnagttg aatccgcggg 600 
ccgcgnttgg aatgccanag 660 
697 
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<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 28, 185, 200, 312, 315, 324, 337, 396, 409, 410, 425, 439, 
465, 481, 483, 485, 488, 501, 519, 523, 530, 547, 555, 580, 
583, 586, 602, 603, 625, 635, 644, 646, 649, 655, 658, 668 
<223> n = A,T,C or G 



<400> 5 

gtactcagac cacctcacat gcagggtnag 
acatcccttt gtgagcacgg ctgctccgga 
cagagggttc tgcaggatgt gctattttaa 
atctngaagc agaacatgtn atcagcgatg 
tatttgaggc tctaccaggc ctgtctacag 
ctgttatttt anatnccaca tatntttttt 
atcgtaacta ctagtgactt ctgaggttta 
agttntgtta ttgatgatng gtaatctaca 
ncntncanct gaccagtttg nagggcactc 
tgtttcnagg gggcntgggg ggtttaaaaa 
cnnctattga aaaaatgggg gtcgnggggg 
tccggaanct tggtttccc 



aaacatggag tgtgcggcag catcctcctc 60 
atactgacca tctgggctag cacgacctaa 120 
agcagctggg tgcaacttgt gaaaacggga 180 
gctgggattg gtggacagga ttgacaggag 240 
gacagcttca tcgaagggac attttttaac 300 
aatgctnaag catacaggtt gaatttctgg 360 
cagttngaat atgttctcnn aggtttatca 420 
cctctggaag ctgtngaatg tgaaaaagat 480 
tcttctggna agnaatccgn ccaaaaaaat 540 
aatgtttctn ttnccntaaa aatgtttacc 600 
gcttnaaatc cccnanttnt gaatnttnta 660 
679 



<210> 6 

<211> 369 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 156, 311, 325, 345, 357 

<223> n = A,T,C or G 



<400> 6 

tcagtccagt catgggtcct ataagagaag 
aagcttcatt tctttaccct gcagcaacag 
aattcattaa ctttgtggtt gaagggagca 
gaaaacaaac agattcatct ccggaaacca 
agctgtatcc tagatggtca atttccagtg 
cttcctacct nggcctgatc agctnggcac 
ctggactga 



tcactctgtg agtttccatg gaggaagaaa 60 
cggagggagg gagagcctat cttctttgca 120 
gcgtcngaaa ctgctttagc acagtgggag 180 
aaggaaaggg tragtgggtt tttattagcc 240 
gatgaataca ccttacgtac gtttctcttg 300 
ttraatcatt ccgtnggggt wgctgtnaca 360 
369 



<210> 7 

<211> 264 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 57, 176 

<223> n = A,T,C or G 



<400> 7 

tgctggatra gggatggggc acgggagcac 
aggaaaggat tacaggcgtg agccactgcg 
agtctctggt tcttctttct cagtttgttg 
tgaacactac actcggaatc aggaagccct 



agatmgactt taactgcccc cacgttntcm 60 
cccggcctct tctccacttt cataggttcc 120 
tttttgcttc ttaaatmatg gagatnagaa 180 
gcctggcgcc tctgtcacct gtctaggggc 240 
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ttcttctcac tgagtcatcc agca 264 

<210> 8 
<211> 280 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> 15, 20, 81, 96, 112, 129, 214, 215, 247 
<223> n = A,T,C or G 

<400> 8 

acctcaactg cccanaacan aactgttgta caagatttga ggatttaaca atatttcaca 60 
tgaaatattt cagacctacg ngagggctta aagacnaatt aaatgagcac cngtgtgccc 120 
accgccccna ttaagaatta gagcaagcag tgaggtgaag ccttgtcctt gcttttaaca 180 
tagaaagtga tccaaattca ccaaacttga cttnnggttt tgcagtgtgg cctcctgatt 240 
ctagacnctg gcgaaacatt tgatgggcaa aaaaaaaaaa 280 

<210> 9 

<211> 449 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 29, 61, 71, 90, 264, 273, 377 
<223> n = A,T,C or G 

<400> 9 

tcgtcaactc caggatggct ttgaaaatna atggacacag atctctcctg ttttgatrat 60 
ntgcagtgct natgactggc tttgcagttn attttgattc aggcaacaga tgttcctttt 120 
ggttccctgt ctcccatggg cgtcatttca tgttgtcctc tgccttcccc cagatattct 180 
aagttcagga cacaagcttc tggcccatgc agagcagagg ccatgagggg tcacagcatg 240 
ggtacgggag gaaacactgg gctnacccag atnctggact tgagtcttgc ctctgctgct 300 
tgctgcacag cttctgtcat ggtgctaaac ctgtgacctg cctcacaggc ttagagcatg 360 
cccgtagaag tactctnaac taaratgctt tccacaaatg agatggtttc atgaaaactt 420 
caaatagagg gcctgggcaa aaaaaaaaa 449 

<210> 10 
<211> 538 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> 173, 207, 218, 225, 252, 281, 282, 290, 385, 395, 433, 454, 
462, 530 

<223> n = A,T,C or G 
<400> 10 

tttttttttt ttcccaaagg cctcaraaca ctagtcttct aattccaagc agaaagttac 60 
atccgccggg atacatgcca cttggtttga taaatcaaaa tacagcatcc ttcagatccc 120 
tttgctgagc aatacaatta tttgtatatg ttactttttt ttctgtttgg ctnaaagatt 180 
tgatatgagc tgaggaaaat gaagccntta ctgctatnag atctnatccc tttccaccac 240 
ctttcaggga tnttggcact gcayatattc agaattcccc nnagtcgctn gtgataaaaa 300 
tgtcttcaga gatggcagaa tatgtttcct ttggtacatg ttcattaaaa atatacacgt 360 
gctcactact gtggatatgt atgtnttgac cgatnacaca ggctgattta gggaagagat 420 
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aaaagcacac ttngaattta ttagcctttc accnagacta anattctgaa attaagaatg 480 
tattccttgg tcaacaattt tcctcttctc ttagccctct tacattgtan tggactga 538 



<210> 11 

<211> 543 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 147, 171, 190, 206, 216, 235, 281, 285, 286, 289, 331, 361, 
364, 385, 392, 393, 409, 418, 429, 448, 449, 514, 535 
<223> n = A,T,C or G 



<400> 11 

tttttttttt ttgcccacag ctgccatctt tgtgtgataa ggccaacctt ctatgggaat 60 
caaccctcgc catcccagca aatcccctct ctcccttctc atgggagtgc cttgtattca 120 
tcaggcatct gggacttgat gtgggtntgg gatttgaaat cagagcacct nggtctctst 180 
caccattctn tcacttatta gctctnacct tgggtnaata cctgccttag tgtcntaggt 240 
acaatatgaa tattgtctat ttctcaggga ttgcaatgac nagtnnatna gtgcatgaga 300 
gggtaaaacc acagggtact ccgctcctcc naagaatgga gaattttttc tagaagccca 360 
natntgcttg gaaggttggc caccnagagc cnnaatcttc ttttatttnc cactgaangc 420 
ctaagaggna attctgaact catccccnna tgacctctcc cgaatmagaa tatctctggc 480 
acttaccata ttttcttgcc ctcttccact tacnaaactc ctttattcct taacnggacg 540 
aaa 543 



<210> 12 

<211> 329 

<212> DNA 

<213> Homo sapiens 



<400> 12 






cgatgacttg 


ggcagtgagt 


gggcctcctg 


ccttggcctc 


ccccctctgc 


agstacctct 


ggcaagacca 


taggtggggt 


gctgggaatc 


ctcaagggag 


agacccactt 


gttcagatgc 


gagccacaga 


gtcaaataaa 


aatcaatttt 


tgatgttcaa 


ggcaagttgc 


aagtcatcg 


<210> 13 






<211> 314 






<212> DNA 






<213> Homo 


sapiens 




<220> 






<221> misc 


feature 




<222> 238, ~ 


244, 267, 273 


<223> n = A,T,C or G 





ccaggtggca gggcacagct tagaccaaac 60 
gaccaagaag gaaactagca agcctatgct 120 
ctcggggccg gctggcaccc actcctggtg 180 
atrggcctca ggcggttcaa ggcrgtctta 240 
gagagaccac agcacctgct gctttgatcg 300 
329 



<400> 13 

cgatgacttg cacccgggag ctgtgacagt 
gcgggagtgg agaccaccaa accctccaaa 
tacctgagcc tgacgcccga gcagtggaag 
catgaaggga gcaccgtgga gaagacagtg 
tctnacccca cccacgggag cctgganctg 
atcccaagtc atcg 



ggcctggaag cagatggcag ccccgtcaag 60 
cagagcaaca actagtacgc ggccagcagc 120 
tcccacagaa gctacagctg ccaggtcacg 180 
gcccctacag aatgttcata ggttcccnac 240 
cangatcccg ggggaagggt ctctctcccc 300 
314 



<210> 14 
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<211> 691 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 20, 166, 186, 196, 361, 54: 

<223> n = A,T,C or G 

<400> 14 

cgattacttg cacaatgcan attagaaccc 
ttccagttca gtgctgctgg gtttttctta 
gaagagaaga ataaagtcta ttttggtctt 
tctacnagaa aacccnaagt gaacccggct 
ggcattacct ttcaacacca gaggttcttt 
gtgaagtggc tgggarggca tcactcggct 
natttctttg gaaaaagatc ctcttgaaag 
ctaggacagt gatcttgccc ctgcttgcas 
tkatgtgsam cgctccttgg atrtkactct 
ccnwtnaatg amssgggccc ttaactccgg 
ycytcttcyt ttgsccmggt tcktcnaaac 
nggtycaacc wttttgggaa mamcyccccc 



546, 577, 581, 626, 636, 661 



aaatgaaggg tacaacccag atcttctggc 60 
ctaaaccaaa acaatkaaga gcatagaagg 120 
tggtagcchg ggtaangaga atgctstcac 180 
aatcaggacc gtgcttggga agggagcagg 240 
gccttctctc tgcagggact cgargactat 300 
tggttcattg gtrttctcat cataaactat 360 
artccttgcc ttccctacag gaaatcaagt 420 
tctccgccgg ctgatcttat csgscccagt 480 
tgttttwctc cvaggaaggg gcytgcmagt 540 
scrggtnamy ncttgsctsc rattttgggt 600 
cacttngttr aattccccgg sccgcctkgc 660 
c 691 



<210> 15 

<211> 355 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 87, 146, 195, 333 
<223> n = A,T,C or G 



<400> 15 

acctgaactg tgtgttgaag 
accgtgccta tgtccgacag 
tcyaccctcg gctcttacct 
cagttcgagc ctctnaagag 
tcaacctctt cctctgggtg 
gcgtctcctc cttcagtcag 

<210> 16 

<211> 522 

<212> DNA 

<213> Homo sapiens 



agtgatgtcc tgctgcctgg 
ctagttncct ccatggatgt 
ttgacnaagt ctcccgttga 
cgtctaagcg atggggatat 
ggagcaagcg tccagcaggg 
atcaccagtg gtntgagtgt 



agctcaagtc actactgatg 60 
gactgagacc aatgtcttct 120 
gagtactacc gaaccaccag 180 
atatttactg gagaatgggc 240 
tgttgtccag agccttttca 300 
tctgccagtt caggt 355 



<220> 

<221> misc_feature 

<222> 90, 205, 213, 240, 264, 381, 405, 410, 414, 417, 423, 429, 

432, 480, 508 

<223> n = A,T,C or G 

<400> 16 

tcagtccagt gaggtggaag acttcgaggc tcgtgggagc cgcttctcca agtctgctga 60 
tgagagacag cgcatgctgg tgcagcgtan ggacgaactc ctccagcaag ctcgcagacg 120 
tttcttgaac aaaagttctg aagatgatgc ggcctcagag agcttcctcc cctcggaagg 180 
tgcgtcctct gaccccgtga ccctncgtcg aangatgctg gctgccgccg cggaacggan 240 
gcttcagaag cagcagacct cctngcgctc ccttgccttc ctcagctgcc tcctgcgccc 300 
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tgtgcccggc tgactggagg aggcctgtcc 

tgactgcatt gccgctgtat naaagcatgt 

ttnatcctnc tntgtaatac ttcctatgtg 

attttaactg tgagtttgat ctcttctngt 

<210> 17 

<211> 317 

<212> DNA 

<213> Homo sapiens 



aattctgccc gccccatgga aaagcgggct 360 
ggtcttacag tgttnggacn gctnatnaat 420 
acatttctct tccccttgga aacactgcan 480 
gttactggac tg 522 



<400> 17 

gtgtcgcgaa ttcgcggtgg tgctaagaaa 

aaggataagc accagagaaa gaaggttcag 

gagaatggca aaattagttg ccttcgtcga 

gtgtttatgg caagtcactt tgacagacat 

ttcaactaac cagaagacaa gtaactgtat 
aaaaaaaaaa actcgag 



aggaagaaga agtcttacac cactcccaag 60 
ccggccgtcc tgaaatatta taaggtggat 120 
gagtgcccct ctgatgaatg tggtgctggg 180 
tattgtggca aatgttgtct gacccactgt 240 
gagttaatta aagacatgaa ctaaaaaaaa 300 
317 



<210> 18 

<211> 392 

<212> DNA 

<213> Homo sapiens 



<400> 18 

tggagatttc taatgaggtg aggaagttcc 

aggaacatgt taaaaatcct tacaaaggca 

agaagcccct gaccccttat ttccgcttct 

tccaccctca gatgagcaac ctggacctga 

ttccggagaa gaagaagatg aaatatgttc 

gagcgaaacc tggcccgatt cagggaggat 

cggacatccc agagaagccc caagaccccc 

<210> 19 

<211> 2624 

<212> DNA 

<213> Homo sapiens 



gtacattgac agaattgatc ctcgatgctc 60 
aaaaactcaa gaaacaccca gacttcccca 120 
tcatggagaa gcgggccaag tatgcgaaac 180 
ccaagattct gtccaagaaa tacaaggagc 240 
cggacttcca gagaagagaa acaggagttc 300 
cacccccacc ttatccagaa tgccaagaat 360 
eg 392 



<400> 19 

gaaacagtga gaaggagatt cctgtgctca 
acattegcat aaaccctcag tcctggtttg 
tcttgggctg cccactgccg gatcctaata 
ecaeggatga cctggatttt aagcaccaca 
ttgtcaatga aatgtgcccc aatattacca 
gectgaaatt gtatgcggta gagatctctg 
ccgagttcca ctacatcgca ggggcccacg 
tgetgetget gcacttcctc tgccaggaat 
tggtggagga gactcgaatc cacattctac 
cctatgaagg aggttccgag ttgggaggct 
tcgatatcaa caacaacttt ccggatttaa 
agaatgcccc aaggaaggtc cccaaccact 
agaatgecac agtggccaca gagaccagag 
ttgtgctggg aggcaaccta caggggggtg 
tgcggtccct gtggaagacc caggagcaca 
ggctggcgta ttcctacgcc tccactcacc 
gccacacgga agattttcag aaggaggagg 
tggctggaag tctaaacgat ttcagctacc 
acgtgggctg tgataaatac ccacacgaga 
gggagtctct gattgtgttc atggagcagg 



atgagctgee agtccccatg gtggcccgct 60 
ataaegggag catctgeatg aggatggaga 120 
actattatca ccgacgtaat gagatgacca 180 
actattagga aatgcgccag ttgatgaagg 240 
ggatttacaa cattggcaaa agccaccagg 300 
accatcctgg ggaacatgaa gttggtgagc 360 
gcaatgaggt tctgggacga gaactgctgc 420 
actcggcgca gaacgcacgc atcgtccgct 480 
cctccctcaa tcctgatggc tatgagaagg 540 
ggtccctggg acgttggacc catgatggca 600 
actcgctgct ctgggaggca gaggaccagc 660 
acattgecat ccctgagtgg tttctgtctg 720 
ccgtcatcgc ctggatggag aagatcccgt 780 
agctggtcgt ggcatacccc tatgacatgg 840 
ccccaacacc tgatgatcat gtgttccgct 900 
gcctcatgac agatgecagg aggcgagtgt 960 
gcaccgtcaa tggggcttcc tggcacacag 1020 
tccatacaaa ctgetttgag ctgtccatct 1080 
gcgagctgcc ggaggaatgg gagaataacc 1140 
ttcatcgagg catcaaaggc atagtgagag 1200 
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atttacaagg gaaagggatt tcaaatgctg 
tccggacagc cagcgatggg gattactggc 
cagccaaggc ggaaggcttt atcacttcca 
gagctactcg gtgtgacttc accctcacaa 
tggagacatt tgggaagcag cctgtcagcc 
ggaaaaggcg gcagcgtggg tgaccctgtc 
aataaaaatc cactccagta gtaactctgt 
ttcaagagac actcaggagc atacctgcat 
tggctcaggg tgttttgttt tttgtttttt 
accttgaaca gagcagcaga gaaaggccgg 
gtctgagatt ctaaaaaggg tgcttgacca 
catttgcgtg acattcaagc ttcccagtgc 
acccagggaa tcctttgccc cagatgttat 
aatccatcct ctctggcccc aggggacaag 
attgacaata gaggcattta ttaccaagtg 
cctttttcca acaaagcttg tcttcctaag 
gagtgctggg cagcagaccc tgggggaggg 
tttgctgctc cattagccct ggggtgagga 
cagtcagggc atcttggaaa agaccttgaa 
gaatgtgaga gctccaagtt ggccccaatc 
gccacatctt gccaatcaag catcatctga 
ggacacgtca gtctgggaga ggtggttgaa 
tgtgttgatc ctgctgcctt gttgacctgg 
ataaagcaaa tggtaagatt aaaaaaaaaa 

<210> 20 

<211> 488 

<212> DNA 

<213> Homo sapiens 



tcatctctgt ggaaggtgtt aaccatgaca 1260 
gtctactgaa ccctggcgaa tatgtggtca 1320 
ccaagaactg catggttggc tatgatatgg 1380 
agaccaacct ggctaggata agagaaatta 1440 
taccctccag gcgcctgaag ctgcggggac 1500 
ggacacttga gacatacccc agaccgtgca 1560 
agcaggcttt ccctgttgtt ttgactgtaa 1620 
ggcttggctg accccaaagg ggagggctgg 1680 
gttttttcct ttgttctcat ttatccaaat 1740 
tggcagtgag ggaattaatt cagtgagtca 1800 
ctggccagga agggaaatca ggccttcccc 1860 
atttgcaagt ggcacagttg acattgcagc 1920 
catttgagat gctcttatgc agcctaagaa 1980 
ccaagctgct atgtacacac tcggtgttct 2040 
tgcatcgctg agtcctaaat cagctctgtt 2100 
agcagacaga agtggagagc acccaagaat 2160 
ggcttgctat cccagaaagc ccctaaaccc 2220 
gagccagaca tgttaggagg ccagagcagt 2280 
ggaagcaaac cctgggttcc ttttgctcca 2340 
aggaggggag taatgatgaa catacagacg 2400 
tgaaaaagaa agcaatctta ggattacctg 2460 
tcattgtgta agggaatagt gtatctaatc 2520 
agagaatgaa acaaacaaac acataaacaa 2580 
aaaaaaaact cgag 2624 



<400> 20 

ctttcaaccc gcgctcgccg gctccagccc 
gctccgcagg gtgaggtggc tttgaccccg 
gctggacagc tggaggatga acggagaagc 
ccccagaggc caagaccgtt ggtcccagga 
gaacaacctt ccatccaatg acagctccca 
ggaaaaagtt gcattgaaag acttttctgg 
ctctaatgag gtgaggaagt tccgtacatt 
tgtttaaaat ccttacaaag gcaaaaaatc 
cctaaccc 



cgcgcgcccc caccccttgc cctcccggcg 60 

ggttgcccgg ccagcacgac cgaggaggtg 120 

cgactgcccc acagacctgg aaatggccgc 180 

agacatgctg actttgctgg aatgcatgaa 240 

gttcaaaacc acccaaacac acatggaccg 300 

agacatgtgc aagctcaaat gggtcgagat 360 

gacagaattg atcctcgata ctcaggaaca 420 

aagaaacacc ccgacttccc cgagaaagcc 480 
488 



<210> 21 

<211> 391 

<212> DNA 

<213> Homo sapiens 

<400> 21 

atggaattgt ggttttctct ttgggatcaa 
tggcgattgc tgatgctttg ggcaaaatcc 
cccgaccatc gaatcttgcg aacaacacga 
tgcttggtca cccaatgacc cgtgccttta 
aaagcatatg caatggcgtt cccatggtga 
atgcaaagcg cagggagact aagggagctg 
ctgaagatct agaagatgct ctgaagagca 



tggtctcaga aattccagag aagaaagctg 60 
ctcagacagt cctgtggcgg tacactggaa 120 
tacttgttca gtggctaccc caaaacgatc 180 
tcacccatgc tagttcccat ggtgttaatg 240 
tgataccctt atttggtgat cagatggaca 300 
gagtgaccct gaatgttctg gagatgactt 360 
g 391 



<210> 22 
<211> 1320 
<212> DNA 
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<213> Homo sapiens 



<400> 22 

aatctgctgg gaatttcttg ggttgacagc 

gtcctggatt acttttcaga aagaagtaat 

gtggtcaaaa tgcagaggct aacattagaa 

atccttttgc atgctcaaga gcccattctt 

cctgcccaag ttatcccact agctgattac 

ccagacttgg gatcagttat aaactctaga 

gcttttgatg aagctatgtc atactgtcga 

ttcaaagatc atgaagagca agataaagtc 

agctctattt ttcagagaca acgtgtggat 

ccacccaaat ttgtgcagct aaagcctgga 

aaagaggcag aacctatacc agaaactgta 

gtacaacaga cagtgagtgc taaaggcccc 

ctggacaaaa gagaagcctg gaagactcct 

ctcttgagct ttgaagtact ttattgtaac 

ttgaaaggat attaggccgg atgtggtggc 

ccatggcggg tggatcactt gaggtcagaa 

accccgtctc tactaaaaat acaaaaatta 

cagctactcg ggaggctgag acaggagact 

gagctgatca tcctgctgtt gcactccagc 

aaagaaggaa agatattatt cccatcatga 

gtaacctttc ctttcccgga cttgagcaac 

gttttaaagc aagataaagg tattggtttt 



tcttggatcc ctattttgaa cagtggtagt 60 
cctttttatg acagaacatg taataatgaa 120 
cacttgaatc agatggttgg aatcgagtac 180 
ttcatcattc ggaagcaaca gcggcagtcc 240 
tatatcattg ctggagtgat ctatcaggca 300 
gtgcttactg cagtgcatgg tattcagtca 360 
tatcatcctt ccaaagggta ttggtggcac 420 
agacctaaag ccaaaaggaa agaagaacca 480 
gctttacttt tagacctcag acaaaaattt 540 
gaaaagcctg ttccagtgga tcaaacaaag 600 
aaacctgagg agaaggagac cacaaagaat 660 
cctgaaaaac ggatgagact tcagtgagta 720 
catgctagtt atcatacctc agtactgtgg 780 
cttcttattt gtatggaatg cgcttatttt 840 
tcacgcctgt aatcccagca ctttgggagg 900 
gttcaagacc agcctgacca atatggtgaa 960 
gccgggcgtg gtggcgggcg cccatagtcc 1020 
tgcttgaacc cgggaggtgg aggttgccct 1080 
ttgggcgaaa gagcgagact ttgtctctat 1140 
tttcttgtga atatttgtaa tatgtttttt 1200 
ctacacactc acatgtttaa tggtagatat 1260 
aaaaaaaaaa aaaaaaaaaa aaaactcgag 1320 



<210> 23 
<211> 633 
<212> DNA 

<213> Homo sapiens 



<400> 23 

ctaagggcag tgaaggtgaa aaccctctca 
tgatgggggt taagccgggg gaggacgcat 
ctgagaaaga tactgcagct gttgtctcca 
atgtgcagat cacagaacca gaagctgagc 
tttcctctcc gagggctccc cagaccagag 
caatgaatgc cacggccacc aaggttgcta 
gtgagaactt gaacaatgag gtcatgatga 
catatgcgca gctgacccac gatgagctga 
taagcaagaa ggagttccag gtccgcgagc 
gggtcatgga agaaaccccc aatatcctcc 
gaaagatgta aattagcaga aaaaaaactc 

<210> 24 
<211> 1328 
<212> DNA 

<213> Homo sapiens 



cggtcccagg gagggagaag gaaggcatgc 60 
cggggcctgc tgaagacctt gtgagaagat 120 
gacagggcag ctccctgaac ctctttgaag 180 
cagagtccaa gtctgaaccg agacctccaa 240 
ctgtcaagcc ccgacttcat cctgtgaagc 300 
actgcagctt gggaactgcc accatcatcg 360 
agaaatacag cccctcggac cctgcatttg 420 
ttcagctggt cctcaaacag aaggaaacga 480 
tggaagacta cattgacaac ctgctcgtca 540 
gcatcccgac tcaggttggc aaaaaagcag 600 
gag 633 



<400> 24 

gtaaacgctc tcggaattat ggcggcggtg 

tgggttgaca gctcttggat ccctattttg 

gaaagaagta atccttttta tgacagaaca 

ctaacattag aacacttgaa tcagatggtt 

gagcccattc ttttcatcat tcggaagcaa 

ctagctgatt actatatcat tgctggagtg 

ataaactcta gagtgcttac tgcagtgcat 



gatatccgag acaatctgct gggaatttct 60 
aacagtggta gtgtcctgga ttacttttca 120 
tgtaataatg aagtggtcaa aatgcagagg 180 
ggaatcgagt acatcctttt gcatgctcaa 240 
cagcggcagt cccctgccca agttatccca 300 
atctatcagg caccagactt gggatcagtt 360 
ggtattcagt cagcttttga tgaagctatg 420 
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tattggtggc acttcaaaga tcatgaagag 480 
aaagaagaac caagctctat ttttcagaga 540 
agacaaaaaa tttccaccca aatttgtgca 600 
ataccagaaa ctgtaaaacc tgaggagaag 660 
agtgctaaag gcccccctga aaaacggatg 720 
gcctggaaga ctcctcatgc tagttatcat 780 
gtactttatt gtaaccttct tatttgtatg 840 
ggccggatgt ggtggctcac gcctgtaatc 900 
tcacttgagg tcagaagttc aagaccagcc 960 
aaaaatacaa aaattagccg ggcgtggtgg 1020 
gctgagacag gagacttgct tgaacccggg 1080 
gctgttgcac tccagcttgg gcgacagagc 1140 
attattccca tcatgatttc ttgtgaatat 1200 
cccggacttg agcaacctac acactcacat 1260 
taaaggtatt tgtataaaaa aaaaaaaaaa 1320 
1328 



tcatactgtc gatatcatcc ttccaaaggg 

caagataaag tcagacctaa agccaaaagg 

caacgtgtgg atgctttact tttagacctc 

gtggatcaaa caaagaaaga ggcagaacct 

gagaccacaa agaatgtaca acagacagtg 

agacttcagt gagtactgga caaaagagaa 

acctcagtac tgtggctctt gagctttgaa 

gaatgcgctt atttttttga aaggatatta 

ccagcacttt gggaggccat ggcgggtgga 

tgaccaatat ggtgaaaccc cgtctctact 

cgggcgccca tagtcccagc tactcgggag 

aggtggaggt tgccctgagc tgattatcat 

gagactttgt ctcaaaaaag aagaaaagat 

ttgtgatatg tcttctgtaa cctttcctct 

gtttactggt agatatgttt aaaagcaaaa 
aaactcga 

<210> 25 

<211> 1758 

<212> DNA 

<213> Homo sapiens 

<400> 25 

gttttttttt tttttttttt aaagagttgc 
tggagatgca gaatttggta tatttcaccc 
tgggtcagga tggctgggtg ccttctcccc 
ggcagggagc tagtaaaacc tcgcaatgac 
gatgaacttg gtcaatccgg agagtccagt 
gctgcccgag gcaactccac ccccattggc 
tatggaggac gaggcgattc ccgccgcagt 
atatccgaga caatctgctg ggaatttctt 
acagtggtag tgtcctggat tacttttcag 
gtaataatga agtggtcaaa atgcagaggc 
gaatcgagta catccttttg catgctcaag 
agcggcagtc ccctgcccaa gttatcccac 
tctatcaggc accagacttg ggatcagtta 
gtattcagtc agcttttgat gaagctatgt 
attggtggca cttcaaagat catgaagagc 
aagaagaacc aagctctatt tttcagagac 
gacaaaaatt tccacccaaa tttgtgcagc 
atcaaacaaa gaaagaggca gaacctatac 
ccacaaagaa tgtacaacag acagtgagtg 
ttcagtgagt actggacaaa agagaagcct 
cagtactgtg gctcttgagc tttgaagtac 
gcgcttattt tttgaaagga tattaggccg 
actttgggag gccatggcgg gtggatcact 
aatatggtga aaccccgtct ctactaaaaa 
gcccatagtc ccagctactc gggaggctga 
gaggttgccc tgagctgatt atcatgctgt 
tttgtctcaa aaaagaagaa aagatattat 
atatgtcttc tgttaccttt cctctcccgg 
ctggtagata tgtttaaaag caaataaagg 
aaaaaaaaaa aactcgag 

<210> 26 

<211> 493 

<212> DNA 

<213> Homo sapiens 



aacaattcat ctttatttct tattttcctc 60 
caagtatatt tgggatagtt ggctcctcgc 120 
tggcatggtt ctcttctctg cagggcgagg 180 
agccgcaatg gcagacccaa tggagcccag 240 
tgctcccagt gactgcagag tagccacaag 300 
aatggccgcc gcggacatca tcttggctgc 360 
gaagcccatg gcactgagtg gcggcggtgg 420 
gggttgacag ctcttggatc cctattttga 480 
aaagaagtaa tcctttttat gacagaacat 540 
taacattaga acacttgaat cagatggttg 600 
agcccattct tttcatcatt cggaagcaac 660 
tagctgatta ctatatcatt gctggagtga 720 
taaactctag agtgcttact gcagtgcatg 780 
catactgtcg atatcatcct tccaaagggt 840 
aagataaagt cagacctaaa gccaaaagga 900 
aacgtgtgga tgctttactt ttagacctca 960 
taaagcctgg agaaaagcct gttccagtgg 1020 
cagaaactgt aaaacctgag gagaaggaga 1080 
ctaaaggccc ccctgaaaaa cggatgagac 1140 
ggaagactcc tcatgctagt tatcatacct 1200 
tttattgtaa ccttcttatt tgtatggaat 1260 
gatgtggtgg ctcacgcctg taatcccagc 1320 
tgaggtcaga agttcaagac cagcctgacc 1380 
tacaaaaatt agccgggcgt ggtggcgggc 1440 
gacaggagac ttgcttgaac ccgggaggtg 1500 
tgcactccag cttgggcgac agagcgagac 1560 
tcccatcatg atttcttgtg aatatttgtt 1620 
aattgagcaa cctacacact cacatgttta 1680 
tattggtata tattgcttca aaaaaaaaaa 1740 
1758 
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<400> 26 

gaggcgagcg gcagggcctg gtggcgagag 
agctttccga gcggacgagc cggccgtgcc 
acacgtcgag ccccgcacag gcaagggtcc 
cagcgcagga gacgcccggc cgcgcgccgg 
gggtcggcgg ccgctctcct gccagcgtcg 
gcgcagccgc gaagattccg ttggaactga 
ccgtggacca caggccctcg ttgcccagat 
ccgtcttcgt catggtgggc ctcccccgcc 
tactcgctgc etc 

<210> 27 

<211> 1331 

<212> DNA 

<213> Homo sapiens 



cgcggctgtc actgcgcccg agcatcccag 60 
gggcatcccc agcctcgcta ccctcgcagc 120 
ggaacttagc ccaaagcacg tttcccctgg 180 
cgcacgcccc cctctcctcc tttgttccgg 240 
ggatctegge cccgggaggc gggcegtegg 300 
cgcagagccg agtgcagaag atctgggtgc 360 
cctgtgggcc aaagctgacc aactcccccg 420 
eggggcaaga cctacttctc cacgaaagct 480 
493 



<400> 27 

ggtggatatc cgagacaatc tgctgggaat 
tttgaacagt ggtagtgtcc tggattactt 
aacatgtaat aatgaagtgg teaaaatgea 
ggttggaatc gagtacatcc ttttgcatgc 
geaacagegg cagtcccctg cccaagttat 
agtgatctat caggcaccag acttgggatc 
gcatggtatt cagtcagctt ttgatgaagc 
agggtattgg tggcacttca aagatcatga 
aaggaaagaa gaaccaagct ctatttttca 
cctcagacaa aaatttccac ccaaatttgt 
agtggatcaa acaaagaaag aggcagaacc 
ggagaccaca aagaatgtac aacagacagt 
gagacttcag tgagtactgg acaaaagaga 
tacctcagta ctgtggctct tgagctttga 
ggaatgeget tattttttga aaggatatta 
ccagcacttt gggaggecat ggcgggtgga 
tgaccaatat ggtgaaaccc cgtctctact 
cgggcgccca tagtcccagc tactegggag 
aggtggaggt tgccctgagc tgattatcat 
gagactttgt ctcaaaaaaa gaagaaaaga 
tttgttatat gtcttctgta acctttcctc 
tgtttactgg tagatatgtt taaaagcaaa 
aaaaactcga g 

<210> 28 

<211> 1333 

<212> DNA 

<213> Homo sapiens 



ttcttgggtt gacagctctt ggatccctat 60 
ttcagaaaga agtaatcctt tttatgacag 120 
gaggctaaca ttagaacact tgaatcagat 180 
tcaagagccc attcttttca teatteggaa 240 
cccactagct gattactata teattgetgg 300 
agttataaac tctagagtgc ttactgcagt 360 
tatgtcatac tgtcgatatc atccttccaa 420 
agagcaagat aaagtcagac etaaagecaa 480 
gagacaacgt gtggatgctt tacttttaga 540 
gcagctaaag cctggagaaa agcctgttcc 600 
tataccagaa actgtaaaac ctgaggagaa 660 
gagtgctaaa ggcccccctg aaaaaeggat 720 
agcctggaag actcctcatg ctagttatca 780 
agtactttat tgtaaccttc ttatttgtat 840 
ggccggatgt ggtggctcac gectgtaate 900 
tcacttgagg tcagaagttc aagaccagcc 960 
aaaaatacaa aaattagccg ggcgtggtgg 1020 
gctgagacag gagacttget tgaacccggg 1080 
gctgttgcac tccagcttgg gegacagage 1140 
tattattccc atcatgattt cttgtgaata 1200 
tcccggactt gagcaaccta cacactcaca 1260 
ataaaggtat tggtataaaa aaaaaaaaaa 1320 
1331 



<400> 28 

cggcggtgga tatccgagac aatctgctgg 
ctattttgaa cagtggtagt gtcctggatt 
acagaacatg taataatgaa gtggtcaaaa 
agatggttgg aatcgagtac atecttttge 
ggaagcaaca gcggcagtcc cctgcccaag 
ctggagtgat ctatcaggca ccagacttgg 
cagtgcatgg tattcagtca gcttttgatg 
ccaaagggta ttggtggcac ttcaaagatc 
ccaaaaggaa agaagaacca agctctattt 
tagacctcag acaaaaattt ccacccaaat 



gaatttcttg ggttgacagc tcttggatcc 60 
acttttcaga aagaagtaat cctttttatg 120 
tgeagagget aacattagaa cacttgaatc 180 
atgetcaaga gcccattctt ttcatcattc 240 
ttatcccact agctgattac tatatcattg 300 
gatcagttat aaactctaga gtgcttactg 360 
aagctatgtc atactgtcga tatcatcctt 420 
atgaagagca agataaagtc agacctaaag 480 
ttcagagaca acgtgtggat gctttacttt 540 
ttgtgcagct aaagcctgga gaaaagcctg 600 
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ttccagtgga tcaaacaaag aaagaggcag aacctatacc agaaactgta aaacctgagg 660 
agaaggagac cacaaagaat gtacaacaga cagtgagtgc taaaggcccc cctgaaaaac 720 
ggatgagact tcagtgagta ctggacaaaa gagaagcctg gaagactcct catgctagtt 780 
atcatacctc agtactgtgg ctcttgagct ttgaagtact ttattgtaac cttcttattt 840 
gtatggaatg cgcttatttt ttgaaaggat attaggccgg atgtggtggc tcacgcctgt 900 
aatcccagca ctttgggagg ccatggcggg tggatcactt gaggtcagaa gttcaagacc 960 
agcctgacca atatggtgaa accccgtctc tactaaaaat acaaaaatta gccgggcgtg 1020 
gtggcgggcg cccatagtcc cagctactcg ggaggctgag acaggagact tgcttgaacc 1080 
cgggaggtgg aggttgccct gagctgatta tcatgctgtt gcactccagc ttgggcgaca 1140 
gagcgagact ttgtctcaaa aaagaagaaa agatattatt cccatcatga tttcttgtga 1200 
atatttgtga tatgtcttct gtaacctttc ctctcccgga cttgagcaac ctacacactc 1260 
acatgtttac tggtagatat gtttaaaagc aaaataaagg tatttgtata aaaaaaaaaa 1320 
aaaaaaactc gag 1333 

<210> 29 

<211> 813 

<212> DNA 

<213> Homo sapiens 

<400> 29 

ctgagctgca cttcagcgaa ttcacctcgg ctgtggctga catgaagaac tccgtggcgg 60 
accgagacaa cagccccagc tcctgtgctg gcctcttcat tgcttcacac atcgggtttg 120 
actggcccgg ggtctgggtc cacctggaca tcgctgctcc agtgcatgct ggcgagcgag 180 
ccacaggctt tggggtggct ctcctactgg ctctttttgg ccgtgcctcc gaggacccgc 240 
tgctgaacct ggtatccccg ctggactgtg aggtggatgc ccaggaaggc gacaacatgg 300 
ggcgtgactc caagagacgg aggctcgtgt gagggctact tcccagctgg tgacacaggg 360 
ttccttacct cattttgcac tgactgattt taagcaattg aaagattaac taactcttaa 420 
gatgagtttg gcttctcctt ctgtgcccag tggtgacagg agtgagccat tcttctctta 480 
gaagcagctt aggggcttgg tggggtctgg agaaaattgt cacagacccc ataggtctcc 540 
atctgtaagc tctgtccctt gtcctccacc ctggtcttta gagccacctc aggtcaccct 600 
ctgtagtgag tgtacttcct gacccaggcc cttgctcaag ctggggctcc ctggggtgtc 660 
taaccagccc tgggtagatg tgactggctg ttagggaccc cattctgtga agcaggagac 720 
cctcacagct cccaccaacc cccagttcac ttgaagttga attaaatatg gccacaacat 780 
aaaaaaaaaa aaaaaaaaaa aaaaaaactc gag 813 

<210> 30 

<211> 1316 

<212> DNA 

<213> Homo sapiens 



<400> 30 

caggcgccca gtcatggccc aagagacagc 
cagtccaatc atagaaaaga tggaaaaaag 
gtggagtcaa atatactttt caccatcagg 
gtattcatca ggactggtgg agtgtgagac 
tgatgtcaaa tctgtaaaga aagagatctg 
taacaaagga ggcgccaccg tggggtgtga 
tgtctgtgcc aaaaaggacc aagcaattct 
attattttgc ccagaacatt ctccagaaca 
aagcatgaag aagaagagag gaaaaaacaa 
aaaaacgatg aaatgtagta acgccaaaag 
agatgcagct gtcaaatctc cttttcttaa 
actatttgaa cacatactag aaaatatgga 
tgcctcagag tcggactatg aagggatcga 
agacacacta agaaaattcc aagaagtaat 
gcaaaggcag atgaagcagc agcttgaggc 
atttcaagaa aatggggacc tggactgctc 
tgaggaccac cagtaaaagc tgttcctcag 



accaccgtgt ggcccagtct caaggggtga 60 
gacatgtgcc ctgtgccctg aaggccacga 120 
aaatatagtt gctcatgaaa actgtttgct 180 
tcttgatcta cgtaatacaa ttagaaactt 240 
gagaggaaga agattgaaat gctcattctg 300 
tttatggttc tgtaagaaga gttaccacta 360 
tcaagttgat ggaaaccatg gaacttacaa 420 
agaagaggcc actgaaagtg ctgatgaccc 480 
acgcctctca tcaggccctc ctgcacagcc 540 
acatatgaca gaagagcctc atggtcacac 600 
gaaatgccag gaagcaggac ttcttactga 660 
ttcagttcat ggaagacttg tggatgagac 720 
gaccttactg tttgactgtg gattatttaa 780 
caagagtaaa gcttgtgaat gggaagaaag 840 
acttgcagac ttacaacaaa gcttgtgctc 900 
aagttctaca tcaggatcct tgctacctcc 960 
gaaaactgga tggggcctcc atgttctcca 1020 
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aggatcgagg aagtcttcct gcctaccctg 
gctttgctca gccttaaatg gaatcttaga 
tggcctcatc atggtctcca ctcagtatta 
actgtgcatt gcacactgtt accatgggtt 
tttagcacac ttaataaatg cttgtcaaaa 

<210> 31 

<211> 1355 

<212> DNA 

<213> Homo sapiens 



cccaccccag tcaagggcag caacaccaga 1080 

gctttctctt gcttctgcta ctcctacaga 1140 

ataactccat cagcatagag caaactcaac 1200 

tatgctcact atcatatcac attgccaata 1260 

cccaaaaaaa aaaaaaaaaa ctcgag 1316 



<400> 31 

cggcggtgga tatccgagac aatctgctgg 
ctattttgaa cagtggtagt gtcctggatt 
acagaacatg taataatgaa gtggtcaaaa 
agatggttgg aatcgagtac atccttttgc 
ggaagcaaca gcggcagtcc cctgcccaag 
ctggagtgat ctatcaggca ccagacttgg 
cagtgcatgg tattcagtca gcttttgatg 
ccaaagggta ttggtggcac ttcaaagatc 
ccaaaaggaa agaagaacca agctctattt 
tagacctcag acaaaaattt ccacccaaat 
ttccagtgga tcaaacaaag aaagaggcag 
agaaggagac cacaaagaat gtacaacaga 
ggatgagact tcagtgagta ctggacaaaa 
atcatacctc agtactgtgg ctcttgagct 
gtatggaatg cgcttatttt ttgaaaggat 
aatcccagca ctttgggagg ccatggcggg 
agcctgacca atatggtgaa accccgtctc 
gtggcgggcg cccatagtcc cagctactcg 
cgggaggtgg aggttgccct gagctgatta 
gaacgagact ttgtctcaaa aaaagaagaa 
aatatttgtt atatgtcttc tggtaacctt 
actcacatgt ttactggtag atatgtttta 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaac 



gaatttcttg ggttgacagc tcttggatcc 60 
acttttcaga aagaagtaat cctttttatg 120 
tgcagaggct aacattagaa cacttgaatc 180 
atgctcaaga gcccattctt ttcatcattc 240 
ttatcccact agctgattac tatatcattg 300 
gatcagttat aaactctaga gtgcttactg 360 
aagctatgtc atactgtcga tatcatcctt 420 
atgaagagca agataaagtc agacctaaag 480 
ttcagagaca acgtgtggat gctttacttt 540 
ttgtgcagct aaagcctgga gaaaagcctg 600 
aacctatacc agaaactgta aaacctgagg 660 
cagtgagtgc taaaggcccc cctgaaaaac 720 
gagaagcctg gaagactcct catgctagtt 780 
ttgaagtact ttattgtaac cttcttattt 840 
attaggccgg atgtggtggc tcacgcctgt 900 
tggatcactt gaggtcagaa gttcaagacc 960 
tactaaaaat acaaaaatta gccgggcgtg 1020 
ggaggctgag acaggagact tgcttgaacc 1080 
tcatgctgtt gcactccagc ttgggcgaca 1140 
aagatattat tcccatcatg atttcttgtg 1200 
tcctctcccg gacttgaagc aacctcacac 1260 
aaagcaaaat aaaggtattt gtttttccaa 1320 
tcgag 1355 





<210> 


32 












<211> 


80 












<212> 


PRT 












<213> 


Homo sapien 








<400> 


32 










Val 


Ser Arg 


He 


Arg 


Gly Gly Ala 


Lys Lys Arg Lys Lys Lys Ser Tyr 


1 






5 






10 15 


Thr 


Thr Pro 


Lys 


Lys 


Asp 


Lys His 


Gin Arg Lys Lys Val Gin Pro Ala 






20 








25 30 


Val 


Leu Lys 


Tyr 


Tyr 


Lys 


Val Asp 


Glu Asn Gly Lys He Ser Cys Leu 




35 








40 


45 


Arg 


Arg Glu 


Cys 


Pro 


Ser 


Asp Glu 


Cys Gly Ala Gly Val Phe Met Ala 




50 








55 


60 


Ser 


His Phe 


Asp 


Arg 


His 


Tyr Cys 


Gly Lys Cys Cys Leu Thr His Cys 


65 








70 




75 80 




<210> 


33 












<211> 


130 












<212> 


PRT 











<213> Homo sapien 
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<400> 


33 










u e 


er 


Asn 


Glu 


Val 


Arg Lys Ph.© Arg Thr 


Leu Thr Glu Leu lie 








5 








T 

eu sp 


a 


Gin 


Glu 


His 


Va.1 Lys Asn Pro Tyr 


Lys Gly Lys Lys Leu 






20 










ys ys 




Pro 


Asp 


Phe 


Pro Lys Lys Pro Leu 


Thr Pro Tyr Phe Arg 
















Phe Phe 


Met 


Glu 


Lys 


Arg 


Aid. Lys Tyr Aid. Lys 


Leu His Pro Gin Met 














5^ m t _ 


Ser Asn 


Leu 


Asp Leu 


Thr 


Lys lie Leu Ser Lys 


Lys Tyr Lys Glu Leu 


65 








70 


75 


80 


Pro Glu 


Lys 


Lys 


Lys 


Met 


Lys Tyr Val Pro Asp 


Phe Gin Arg Arg Glu 








85 






- „, „ ^ 5 _ 


Thr Gly 


Val 


Arg Ala Lys 


Gl 11 Gl 


G-Ly Gly Ser Pro Pro 






100 






r ° Y 105 S n 




Pro Tyr 


Pro 


Glu 


Cys 


Gin 


Glu Ser Asp He Pro 


Glu Lys Pro Gin Asp 




115 








120 


125 


Pro Pro 














130 














<210> 


34 










<211> 


506 










<212> 


PRT 










<213> 


Homo sapien 






<400> 


34 










Asn Ser 


Glu 


Lys 


Glu 


He 


Pro Val Leu Asn Glu 


Leu Pro Val Pro Met 


1 






5 








Val Ala 


Arg 


Tyr 


He 


Arg 


He Asn Pro Gin Ser 


Tr Ph A Asn Gl 






20 








30^ n ^ 


Ser He 


Cys 


Met 


Arg 


Met 


Gl 11 L 5 Gl 






35 








U 40 S SU y ^ 


r ° 45" r ° SP r ° 


Asn Asn 


Tyr 


Tyr 


His 


Arg 


Ar A Gl M t Th 


Th Th A A 


50 










^rg sn u e r 


6Q r 37 SP SP eU 


Asp Phe 


Lys 


His 


His 


Asn 


Gl M t A 


Gin Leu Met Lys Val 


65 








70 


y ys u e ^rg 




Val Asn 


Glu 


Met 


Cys 


Pro 


Asn He Thr Arg He 


A n 11 Gl l° 








85 






yr Sn S 95 y yS 


Ser His 


Gin 


Gly 


Leu 


Lys 


Leu Tyr Ala Val Glu 


He Ser Asp His Pro 






100 






105 


110 


Gly Glu 


His 


Glu 


Val 


Gly 


Glu Pro Glu Phe His 


Tyr He Ala Gly Ala 




115 








120 




His Gly 


Asn 


Glu 


Val 


Leu 


Gly Arg Glu Leu Leu 


L 25 


130 










135 


140 SU 8U SU 1S 


Phe Leu 


Cys 


Gin 


Glu 


Tyr 


Ser Ala Gin Asn Ala 


A 11 V 1 A 
rg e a rg eu 


145 








150 






Val Glu 


Glu 


Thr 


Arg 


He 


His 11 i? 55 


A 








165 




6 SU 170 Sr 


eu sn ro ^sp y 


Tyr Glu 


Lys 


Ala 


Tyr 


Glu 


Gly Gly Ser Glu Leu 


Gly Gly Trp Ser Leu 






180 






185 


190 


Gly Arg 


Trp 


Thr 


His 


Asp 


Gly He Asp He Asn 


Asn Asn Phe Pro Asp 




195 








200 


205 


Leu Asn 


Ser 


Leu 


Leu 


Trp 


Glu Ala Glu Asp Gin 


Gin Asn Ala Pro Arg 


210 










215 


220 


Lys Val 


Pro 


Asn 


His 


Tyr 


He Ala He Pro Glu 


Trp Phe Leu Ser Glu 


225 








230 


235 


240 


Asn Ala 


Thr 


Val 


Ala 


Thr 


Glu Thr Arg Ala Val 


He Ala Trp Met Glu 
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245 


250 


255 




Lys 


lie 


Pro 


Phe 


Val Leu Gly 


Gly Asn Leu Gin Gly Gly 


Glu Leu 


Val 








260 




265 


270 




Val 


Ala 


Tyr 


Pro 


Tyr Asp Met 


Val Arg Ser Leu Trp Lys 


Thr Gin 


Glu 






275 






280 285 






His 


Thr 


Pro 


Thr 


Pro Asp Asp 


His Val Phe Arg Trp Leu 


Ala Tyr 


Ser 




290 






295 


300 




Tyr 


Ala 


Ser 


Thr 


His Arg Leu 


Met Thr Asp Ala Arg Arg 


Arg Val 


Cys 


305 








310 


315 




320 


His 


Thr 


Glu 


Asp 


Phe Gin Lys 


Glu Glu Gly Thr Val Asn 


Gly Ala 












325 


330 


335 




Trp 


His 


Thr 


Val 


Ala Gly Ser 


Leu Asn Asp Phe Ser Tyr 


Leu His 


Thr 








340 




345 


350 




Asn 


Cys 


Phe 


Glu 


Leu Ser He 


Tyr Val Gly Cys Asp Lys 


Tyr Pro 


His 






355 






360 365 






Glu 


Ser 


Glu 


Leu 


Pro Glu Glu 


Trp Glu Asn Asn Arg Glu 


Ser Leu 


He 




370 






375 


380 






Val 


Phe 


Met 


Glu 


Gin Val His 


Arg Gly He Lys Gly He 


Val Arg Asp 


385 








390 


395 




400 




Gin 


Gly 


Lys 


Gly He Ser 


Asn Ala Val He Ser Val 


Glu Gly Val 










405 


410 


415 




Asn 


His 


Asp 


He 


Arg Thr Ala 


Ser Asp Gly Asp Tyr Trp 


Arg Leu Leu 








420 




425 


430 




Asn 


Pro 


Gly 


Glu 


Tyr Val Val 


Thr Ala Lys Ala Glu Gly 


Phe He 


Thr 






435 






440 445 






Ser 


Thr 


Lys 


Asn 


Cys Met Val 


Gly Tyr Asp Met Gly Ala 


Thr Arg 


Cys 




450 






455 


460 






Asp 


Phe 


Thr 


Leu 


Thr Lys Thr 


Asn Leu Ala. Arg lie Ajtcj 


Glu He 


Met 


465 








470 


475 




480 


Glu 


Thr 


Phe 


Gly 


Lys Gin Pro 


Val S@3T Leu Piro Sesr Arcj 


Arg Leu Lys 










485 


490 


495 




Leu 


Arg 


Gly 


Arg 


Lys Arg Arg 


Gin Arg Gly 












500 




505 








<210> 


35 












<211> 


96 












<212> 


PRT 












<213> 


Homo sapien 










<400> 


35 










Met 


Asn 


Gly 


Glu 


Ala Asp Cys 


Pro Thr Asp Leu Glu Met 


Ala Ala 


Pro 


1 








5 


10 


15 




Arg 


Gly 


Gin 


Asp 


Arg Trp Ser 


Gin Glu Asp Met Leu Thr 


Leu Leu Glu 








20 




25 


30 




Cys 


Met 


Lys 


Asn 


Asn Leu Pro 


Ser Asn Asp Ser Ser Gin 


Phe Lys 


Thr 






35 






40 45 






Thr 


Gin 


Thr 


His 


Met Asp Arg 


Glu Lys Val Ala Leu Lys 


Asp Phe 


Ser 




50 






55 


60 






Gly 


Asp 


Met 


Cys 


Lys Leu Lys 


Trp Val Glu He Ser Asn 


Glu Val 


Arg 


65 








70 


75 




80 


Lys 


Phe 


Arg 


Thr 


Leu Thr Glu 


Leu He Leu Asp Thr Gin 


Glu His 


Val 










85 


90 


95 






<210> 


36 












<211> 


129 












<212> 


PRT 












<213> 


Homo sapien 
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<400> 


36 






Gly 


e 


Val 


Val Phe Ser 


Leu Gly Ser Met Val Ser Glu 


lie Pro Glu 








5 


10 




Lys 


Lys 


Ala 


Val Ala He 


Ala Asp Ala Leu Gly Lys He 


Gl Th 














Val 


eu 


Trp 


Arg lyr inr 


Gly Thr Arg Pro Ser Asn Leu 


Ala Asn A n 
a sn sn 






35 




40 45 




Thr 




Leu 


Val Gin Trp 


Leu Pro Gin Asn Asp Leu Leu 


Gly His Pro 














Met 


Thr 


Arg 


Ala Phe He 


Thr His Ala Ser Ser His Gly 


Val Asn Glu 


65 






70 


75 




Ser 


He 


Cys 


Asn Gly Val 


Pro Met Val Met He Pro Leu 


Phe Gly Asp 








85 


90 


95 


Gin 


Met 


Asp 


Asn Ala Lys 


Arg Arg Glu Thr Lys Gly Ala 


Gly Val Thr 








100 


105 


110 


Leu 


Asn 


Val 


Leu Glu Met 


Thr Ser Glu Asp Leu Glu Asp 


Ala Leu Lys 






115 




120 125 




Ser 

















37 












<211> 


238 














PRT 












<213> 


Homo sapien 








< > 


37 










sn 


Leu Leu 


Gly 


He 


Ser 


Trp Val Asp Ser Ser Trp He 


Pro He Leu 








5 








Asn 


Ser Gly 


Ser 


Val 


Leu 


Asp Tyr Phe Ser Glu Arg Ser 


Asn Pro Phe 






20 






25 


30 


Tyr 


Asp Arg 


Thr 


Cys 


Asn 


Asn Glu Val Val Lys Met Gin 


Arg Leu Thr 




35 








40 45 




Leu 


Glu His 


Leu 


Asn 


Gin 


Met Val Gly He Glu Tyr He 


Leu Leu His 




50 








55 60 




Ala 


Gin Glu 


Pro 


He 


Leu 


Phe He He Arg Lys Gin Gin 


Arg Gin Ser 


65 








70 


75 


80 


Pro 


Ala Gin 


Val 


He 


Pro 


Leu Ala Asp Tyr Tyr He He 


Ala Gly Val 








85 




90 


95 


He 


Tyr Gin 


Ala 


Pro 


Asp 


Leu Gly Ser Val He Asn Ser 


Arg Val Leu 






100 






105 


110 


Thr 


Ala Val 


His 


Gly 


He 


Gin Ser Ala Phe Asp Glu Ala 


Met Ser Tyr 




115 








120 125 




Cys 


Arg Tyr 


His 


Pro 


Ser 


Lys Gly Tyr Trp Trp His Phe 


Lys Asp His 




130 








135 140 




Glu 


Glu Gin 


Asp 


Lys 


Val 


Arg Pro Lys Ala Lys Arg Lys 


Glu Glu Pro 


145 








150 


155 


160 


Ser 


Ser He 


Phe 


Gin 


Arg 


Gin Arg Val Asp Ala Leu Leu 


Leu Asp Leu 








165 




170 


175 


Arg 


Gin Lys 


Phe 


Pro 


Pro 


Lys Phe Val Gin Leu Lys Pro 


Gly Glu Lys 






180 






185 


190 


Pro 


Val Pro 


Val 


Asp 


Gin 


Thr Lys Lys Glu Ala Glu Pro 


He Pro Glu 




195 








200 205 




Thr 


Val Lys 


Pro 


Glu 


Glu 


Lys Glu Thr Thr Lys Asn Val 


Gin Gin Thr 




210 








215 220 




Val 


Ser Ala 


Lys 


Gly 


Pro 


Pro Glu Lys Arg Met Arg Leu 


Gin 


225 








230 


235 





<210> 38 
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<211> 


202 












<212> 


PRT 












<213> 


Homo sapien 








<400> 
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ys 


Gly 


Ser 


Glu 


Gly 
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Asn Pro Leu Tnr Val 


Pro Gly Arg Glu Lys 










5 








Glu 
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Gly 
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Asp Ala Ser Gly Pro 








20 










Ala 


Glu 


Asp 


Leu 


Val 
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Arg Ser Glu Lys Asp 


Thr Ala Ala Val Val 






35 












er 




G n 


Gly 


Ser 


Ser 


Leu Asn Leu Pne Glu 


Asp Val Gin He Thr 
















fP 


Glu 


ro 


u 


Ala 


Glu 


Pro 


Glu Ser Lys Ser Glu 


Pro Arg Pro Pro He 
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er 
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Pro 
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eu 
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Thr 
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Thr 


He 




Asn Asn Glu Val Met 
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6 ^ u sn eu 




Met 


Lys 


Lys 


Tyr 


Ser 


Pro 


Ser As Pro Ala Ph 


Ala Tyr Ala Gin Leu 
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^er sp ro a e 




Thr 


His 
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Glu 
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He 
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Gin Lys Glu Thr He 


145 










150 
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Ser 


Lys 


Lys 


Glu 


Phe 


Gin 


Val Arg Glu Leu Glu 


Asp Tyr He Asp Asn 
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Leu 


Leu 


Val 


Arg 


Val 


Met 


Glu Glu Thr Pro Asn 


He Leu Arg He Pro 
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Val 
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Lys 
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Val 
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Ser 


Val 
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Asp 


Tyr 


Pne Ser Glu Arg Ser 
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Glu 


Val Val Lys Met Gin 


Arg Leu Thr Leu Glu 




50 












5° ■ 


His 


Leu 


Asn 


Gin 


Met 


Val 


r<l Tl rti m 

Gly 1x6 Glu Tyr He 


Leu Leu His Ala Gin 


65 










70 






Glu 


Pro 


He 


Leu 


Phe 


He 


He Arg Lys Gin Gin 


Arg Gin Ser Pro Ala 










85 








Gin 


Val 


He 


Pro 


Leu 


Ala 


Asp Tyr Tyr He He 


Ala Gly Val He Tyr 








100 






105 


110 


Gin 


Ala 


Pro 


Asp 


Leu 


Gly 


Ser Val He Asn Ser 


Arg Val Leu Thr Ala 






115 








120 


125 


Val 


His 


Gly 


He 


Gin 


Ser 


Ala Phe Asp Glu Ala 


Met Ser Tyr Cys Arg 




130 










135 


140 


Tyr 


His 


Pro 


Ser 


Lys 


Gly 


Tyr Trp Trp His Phe 


Lys Asp His Glu Glu 


145 










150 


155 


160 
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Gin 


Asp 


Lys 


Val 




Lys 


Ala 


Lys 














165 








170 


175 


lie 


Phe 


Gin 


Arg 


Gin Arg 


Val 


ASD 


Ala 












180 








185 




190 


Lys 


He 


Ser 


Thr 


Gin He 


Cys 


Ala 


Val 


Asp 


Gin Thr Lys Lys Glu Ala 






195 








200 






205 


Glu 


Pro 


lie 


Pro 


Glu Thr 


Val 


Lys 


Pro 


Glu 


Glu Lys Glu Thr Thr Lys 




210 








215 








220 


Asn 


Val 


Gin 


Gin 


Thr Val 


Ser 


Ala 


Lys 


Gly 


Pro Pro Glu Lys Arg Met 


225 








230 










235 240 


Arg 


Leu 


Gin 

















<210> 40 
<211> 245 
<212> PRT 
<213> Homo sapien 

<400> 40 



Ala 


Ala 


Val 


Asp 




Arg 


sp 


sn 


eu 




Gly Tie Ser Trp Val Asp 




















1Q U 




Ser 


Ser 


T 

rp 


He 


Pro 


He 


Leu 
eu 


Asn 
sn 


Ser 


Gl 


Ser Val Le A T § Phe 








20 














er a eu ^sp yr e 


Ser 


Glu 


Arg 




Asn 


Pro 


Phe 


Tyr 


sp 


rg 


Thr Cys Asn Asn Glu Val 






35 










40 








Val 


Lys 


Met 


Gin 


Arg 


Leu 


Thr 


Leu 


Glu 


His 


Leu Asn Gin Met Val Gly 




50 










55 








60 


He 


Glu 


Tyr 


He 


Leu 


Leu 


His 


Ala 


Gin 


Glu 


Pro He Leu Phe He He 


65 










70 










75 80 


Arg 


Lys 


Gin 


Gin 


Arg 


Gin 


Ser 


Pro 


Ala 


Gin 


Val lie Pro Leu Ala Asp 










85 










90 


95 


Tyr 


Tyr 


He 


He 


Ala 


Gly 


Val 


He 


Tyr 


Gin 


Ala Pro Asp Leu Gly Ser 








100 










105 




110 


Val 


He 


Asn 


Ser 


Arg 


Val 


Leu 


Thr 


Ala 


Val 


His Gly He Gin Ser Ala 






115 










120 






125 


Phe 


Asp 


Glu 


Ala 


Met 


Ser 


Tyr 


Cys 


Arg 


Tyr 


His Pro Ser Lys Gly Tyr 




130 










135 








140 


Trp 


Trp 


His 


Phe 


Lys 


Asp 


His 


Glu 


Glu 


Gin 


Asp Lys Val Arg Pro Lys 


145 










150 










155 160 


Ala 


Lys 


Arg 


Lys 


Glu 


Glu 


Pro 


Ser 


Ser 


He 


Phe Gin Arg Gin Arg Val 










165 










170 


175 


Asp 


Ala 


Leu 


Leu 


Leu 


Asp 


Leu 


Arg 


Gin 


Lys 


Phe Pro Pro Lys Phe Val 








180 










185 




190 


Gin 


Leu 


Lys 


Pro 


Gly 


Glu 


Lys 


Pro 


Val 


Pro 


Val Asp Gin Thr Lys Lys 






195 










200 






205 


Glu 


Ala 


Glu 


Pro 


He 


Pro 


Glu 


Thr 


Val 


Lys 


Pro Glu Glu Lys Glu Thr 




210 










215 








220 


Thr 


Lys 


Asn 


Val 


Gin 


Gin 


Thr 


Val 


Ser 


Ala 


Lys Gly Pro Pro Glu Lys 


225 










230 










235 240 


Arg 


Met 


Arg 


Leu 


Gin 






















245 















<210> 41 
<211> 163 
<212> PRT 
<213> Homo sapien 



<400> 41 
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Gly 


Glu 


Arg 


Gin 


Gly Leu 


Val 


Ala 


A 


Ala Arg 


Leu Ser Leu Ar Pro 
eu r eu rg ro 


1 








5 










10 






lie 


Pro 


Glu 


Leu 


Ser 


Glu 
U 


Arg 


Thr 


Ser Arg 


Pro Cys Arg Ala Ser 








20 










25 






Pro 


Ala 


Ser 




Pro 


Ser 


Gin 
n 


His 


Thr 


Ser 
er er 


Pro Ala Gin Ala Arg 






35 


















Val 


Arg 


Asn 


Leu 


Ala 


Gin 


Ser 


Thr 


Phe 


Pro Leu 


Ala Ala Gin Glu Thr 




50 










55 










Pro 


Gly 


A 


Ala 


Pro 


Ala 


Hi 
is 


Ala 


ro 


eu er 


Ser Pne Val Pro Gly 


65 










70 












Val 


Gly 


Glv 


Ar 


Ser 


Pro 


Ala 


Ser 


Val 


Gl He 


Ser AXa Pro Gly Gly 










85 














n 


Pro 


Ser 
er 


Gly 


Ala 


Ala 


a 


ys 




L 

ro eu 


Glu Leu Thr Gin Ser 








100 










105 






Arg 


Val 


Gin 


Lys 


He 


Trp 


Val 


Pro 


Val 


Asp His 


Arg Pro Ser Leu Pro 






115 










120 






125 


Arg 


Ser 


Cys 


Gly 


Pro 


Lys 


Leu 


Thr 


Asn 


Ser Pro 


Ala Val Phe Val Met 




130 










135 








140 


Val 


Gly 


Leu 


Pro 


Arg Pro 


Gly 


Gin 


Asp 


Leu Leu 


Leu His Glu Ser Leu 


145 










150 








155 


160 


Leu 


Ala 


Ala 



















<210> 42 

<211> 243 

<212> PRT 

<213> Homo sapien 



<400> 42 



Val 


Asp 


He 


Arg 


Asp 


Asn 


Leu 


Leu 


Gly 


He 


Ser 


Trp Val Asp Ser Ser 


1 








5 










10 




15 


Trp 


lie 


Pro 


He 


Leu 


Asn 


Ser 


Gly 


Ser 


Val 


Leu 


Asp Tyr Phe Ser Glu 








20 










25 






30 


Arg 


Ser 


Asn 


Pro 


Phe 


Tyr 


Asp 


Arg 


Thr 


Cys 


Asn 


Asn Glu Val Val Lys 






35 










40 








45 


Met 


Gin 


Arg 


Leu 


Thr 


Leu 


Glu 


His 


Leu 


Asn 


Gin 


Met Val Gly He Glu 




50 










55 










60 


Tyr 


He 


Leu 


Leu 


His 


Ala 


Gin 


Glu 


Pro 


He 


Leu 


Phe He He Arg Lys 


65 










70 










75 


80 


Gin 


Gin 


Arg 


Gin 


Ser 


Pro 


Ala 


Gin 


Val 


He 


Pro 


Leu Ala Asp Tyr Tyr 










85 










90 




95 


He 


He 


Ala 


Gly 


Val 


He 


Tyr 


Gin 


Ala 


Pro 


Asp 


Leu Gly Ser Val He 








100 










105 






110 


Asn 


Ser 


Arg 


Val 


Leu 


Thr 


Ala 


Val 


His 


Gly 


He 


Gin Ser Ala Phe Asp 






115 










120 








125 


Glu 


Ala 


Met 


Ser 


Tyr 


Cys 


Arg 


Tyr 


His 


Pro 


Ser 


Lys Gly Tyr Trp Trp 




130 










135 










140 


His 


Phe 


Lys 


Asp 


His 


Glu 


Glu 


Gin 


Asp 


Lys 


Val 


Arg Pro Lys Ala Lys 


145 










150 










155 


160 


Arg 


Lys 


Glu 


Glu 


Pro 


Ser 


Ser 


He 


Phe 


Gin Arg 


Gin Arg Val Asp Ala 










165 










170 




175 


Leu 


Leu 


Leu 


Asp 


Leu 


Arg 


Gin 


Lys 


Phe 


Pro 


Pro 


Lys Phe Val Gin Leu 








180 










185 






190 


Lys 


Pro 


Gly 


Glu 


Lys 


Pro 


Val 


Pro 


Val 


Asp 


Gin 


Thr Lys Lys Glu Ala 






195 










200 








205 


Glu 


Pro 


He 


Pro 


Glu 


Thr 


Val 


Lys 


Pro 


Glu 


Glu 


Lys Glu Thr Thr Lys 




210 










215 










220 


Asn 


Val 


Gin 


Gin 


Thr 


Val 


Ser 


Ala 


Lys 


Gly Pro 


Pro Glu Lys Arg Met 
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225 230 235 240 

Arg Leu Gin 



<210> 43 

<211> 244 

<212> PRT 

<213> Homo sapien 



<400> 43 



Ala 


Val 


ASD 


He 


Arg 


ASD 


Asn 


Leu 


Leu 


Gly He 


Ser 


Tr 

P 


Val 


Asp Ser 


1 








5 










10 








15 




Tr 

P 


He 


Pro 


He 


Leu 


Asn 


Ser 


Gly 




Leu 


sp 


Tyr 










20 










25 








30 




Glu 


rg 


Ser 




Pro 


Phe 


T 


Asp 


Arg 


Thr C s 
ys 


Asn 


Asn 


Glu 


Val Val 






35 










40 








45 






Lvs 


Met 




rg 


Leu 


Thr 


Leu 




His 


Leu Asn 


Gin 




Val 


Gl He 
y e 




50 










55 








60 






Glu 


Tyr 


He 


Leu 
eu 


Leu 


His 


Ala 


Gin 


Gl 

U 


Pro 11 


Leu 


Phe 


11 

e 


11 A 


65 










70 








r ° 75 S 










L 


Gin 


Gin 


rg 


Gin 


Ser 


Pro 


Ala 


Gin 


Val He 


Pro 


Leu 
eu 


Ala 


Asp Tyr 










85 










90 








95 


Tyr 


lie 


He 


Ala 




Val 


He 


Tyr 


Gin 




Asp 


Leu 


Gly 










100 










105 








110 




lie 








Val 


















Ala Phe 






115 










120 








125 






Asp 


Glu 


Ala 


Met 


Ser 


Tyr 




i\rg 


Tyr 


His Pro 


^40 


Lys 


Gly 


Tyr Trp 




130 


























Trp 


His 


Phe 


L S 
ys 


Asp 


His 


Glu 


Glu 


Gin 


Asp Lys 


Val 


Arg 


Pro 


Lys AJ.a 


145 










150 








155 








160 


Lys 


Arg 


Lys 


Glu 


Glu 


Pro 


Ser 


Ser 


He 


Phe Gin 


Arg 


Gin 


Arg 


Val Asp 










165 




















Ala 


Leu 


Leu 


Leu 


Asp 


Leu 


rg 


Gin 


Lys 


Phe Pro 
e ro 


Pro 
ro 


ys 


Ph 


V 1 Gl 

a n 








180 










185 








190 




Leu 


Lys 


Pro 


Gly 


Glu 


Lys 


Pro 


Val 


Pro 


Val As 


Gin 


Thr 


Y 


Lys Glu 






195 


















205 






Ala 


Glu 


Pro 


He 


Pro 


Glu 


Thr 


Val 


_ 


Pro Gl 

U 


Glu 


ys 


Gl 

U 


Th Th 
r r 




210 










215 








220 








Lys 


Asn 


Val 


Gin 


Gin 


Thr 


Val 


Ser 


Ala 


Lys Gly 


Pro 


Pro 


Glu 


Lys Arg 


225 










230 








235 










Met 


Arg 


Leu 


Gin 
























<210> 


44 
























<211> 


109 
























<212> 


PRT 
























<213> 


Home 


> sapien 




















<400> 


44 






















Glu 


Leu 


His 


Phe 


Ser 


Glu 


Phe 


Thr 


Ser 


Ala Val 


Ala 


Asp 


Met 


Lys Asn 


1 








5 










10 








15 


Ser 


Val 


Ala 


Asp 


Arg Asp 


Asn 


Ser 


Pro 


Ser Ser 


Cys 


Ala 


Gly 


Leu Phe 








20 










25 








30 




lie 


Ala 


Ser 


His 


He Gly 


Phe 


Asp 


Trp 


Pro Gly 


Val 


Trp 


Val 


His Leu 






35 










40 








45 






Asp 


lie Ala 


Ala 


Pro 


Val 


His 


Ala 


Gly 


Glu Arg 


Ala 


Thr 


Gly 


Phe Gly 




50 










55 








60 








Val 


Ala Leu 


Leu 


Leu 


Ala 


Leu 


Phe 


Gly 


Arg Ala 


Ser 


Glu 


Asp 


Pro Leu 
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65 










70 


75 






Asn 


Leu 


Val 


Ser Pro Leu Asp 


Cys Glu Val 


Asp Ala. Gin Glu. Gly 










85 




90 


95 


Asp 


Asn 


Met 


Gly 


Arg Asp Ser Lys 


Arg Arg Arg 


Leu Val 








100 






105 






<210> 


45 












<211> 


324 












<212> 


PRT 












<213> 


Homo sapien 








<400> 


45 










Arg 


Arg 


Pro 


Val 


Met 


Ala Gin Glu 


Thr Ala Pro 


Pro Cys Gly Pro Val 


1 








5 




10 


15 


Ser 


Arg 


Gly 


Asp 


Ser 


Pro He He 


Glu Lys Met 


Glu Lys Arg Thr Cys 








20 






25 


30 


Ala 


Leu 


Cys 


Pro 


Glu 


Gly His Glu 


Trp Ser Gin 


He Tyr Phe Ser Pro 






35 






40 




45 


Ser 


Gly Asn 


He 


Val 


Ala His Glu 


Asn Cys Leu 


Leu Tyr Ser Ser Gly 












55 




60 


Leu 


a 


U 


Cys 


Glu 


Thr Leu Asp 


Leu Arg Asn 


Thr He Arg Asn Phe 


65 










70 


75 


80 


Asp 


a 


ys 


Ser 


Val 


Lys Lys Glu 


lie Trp Arg 


Gly Arg Arg Leu Lys 










85 




90 


95 


C 


Ser 


Phe 


Cys 


Asn 


Lys Gly Gly 


Ala Thr Val 










100 






105 


110 


Phe 


Cys 


Lys 


Lys 


Ser 


Tyr His Tyr 


Val Cys Ala 


Lys Lys Asp Gin Ala 






115 






120 




125 


lie 


Leu 


Gin 


Val 


Asp 


Gly Asn His 


Gly Thr Tyr 






130 








135 




140 


Glu 


His 


Ser 


Pro 


Glu 


Gin Glu Glu 


Ala Thr Glu 


Ser Ala Asp Asp Pro 


145 










150 


155 


160 




Met 


Lys 


Lys 


Lys 


Arg Gly Lys 


Asn Lys Arg 












165 




170 


175 


Pro 


Ala 


Gin 


Pro 


Lys 


Thr Met Lys 


Cys Ser Asn 










180 






185 


190 


Thr 


Glu 


Glu 


Pro 


His 


Gly His Thr 


Asp Ala Ala 


Val Lys Ser Pro Phe 






195 






200 




205 


Leu 


Lys 


Lys 


Cys 


Gin 


Glu Ala Gly 


Leu Leu Thr 






210 








215 




220 


lie 


Leu 


Glu 


Asn 


Met 


Asp Ser Val 


His Gly Arg 


Leu Val Asp Glu Thr 


225 










230 


235 


240 


Ala 


Ser 


Glu 


Ser 


Asp 


Tyr Glu Gly 


He Glu Thr 


Leu Leu Phe Asp Cys 










245 




250 


255 


Gly 


Leu 


Phe 


Lys 


Asp 


Thr Leu Arg 


Lys Phe Gin 


Glu Val He Lys Ser 








260 






265 


270 


Lys 


Ala 


Cys 


Glu 


Trp 


Glu Glu Arg 


Gin Arg Gin 


Met Lys Gin Gin Leu 






275 






280 




285 


Glu 


Ala 


Leu 


Ala 


Asp 


Leu Gin Gin 


Ser Leu Cys 


Ser Phe Gin Glu Asn 




290 








295 




300 


Gly 


Asp Leu 


Asp 


Cys 


Ser Ser Ser 


Thr Ser Gly 


Ser Leu Leu Pro Pro 


305 










310 


315 


320 


Glu 


Asp 


His 


Gin 











<210> 46 
<211> 244 
<212> PRT 
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<213> Homo sapien 





<400> 


46 


















Ala 


Val 


sp 


He 




sp 


Asn 


Leu 
eu 


eu 


Gl He 
1fl y 6 


ber Trp Val Asp 


er 


1 


























Trp 


e 


Pro 


He 


Leu 


Asn 


er 




Ser Val 
er a 


Leu Asp Tyr Phe 


er 








20 


















Glu 


rg 






Pro 


Phe 


Tyr 




Arg 


Thr C s 
r ys 


Asn Asn Glu Val 


a 
















40 ? 










L 


Met 


Gin 


A 


Leu 


Thr 


Leu 


Glu 


His 


Leu Asn 


Gin Met Val Gly 


11 

e 




50 










55 








60 




Glu 


lyr 


Tl 


Leu 


Leu 


His 


Ala 


Gin 


Glu 


Pro He 


Leu Phe He He 


Arg 


65 










70 








75 






Lys 


Gin 
n 


Gin 


Arg 


Gin 


Ser 


Pro 


Ala 


Gin 




Pro Leu Ala A 


Tyr 










85 










90 


u a ^sp 




Tvr 


lie 


He 


Ala 


Gly 


Val 


He 


r 


Gin 


Ala Pro 


_ L Gl S 


V 1 








100 










105 




P eU no Sr 




lie 


sn 




Arg 


Val 


Leu 


Thr 


Ala 


Val 


His Glv 


He Gin Ser Ala 


Ph 
















120 






/-I m 




As 

P 


tn 


a 


Met 


Ser 
er 


Tyr Cys 


Arg 


lyr 


His Pro 


ber Lys Gly Tyr 


Trp 














135 












Trp 


is 


Phe 


L 


A 

sp 


His 


Glu 


Glu 
U 


Gin 


Asp Lys 


Vax Arg Pro Lys 




145 










150 








155 




fin 


Lys 


Arg 


Lys 


Glu 


Glu 


Pro 


Ser 


Ser 


He 




Arg Gin Arg Val 


sp 










165 










170 


175 




Ala 


Leu 


Leu 


Leu 


Asp 


Leu Arg 


Gin 


Lys 


Phe Pro 


Pro Lys Phe Val 


Gin 








180 










185 




190 




Leu 


Lys 


Pro 


Gly 


Glu 


Lys 


Pro 


Val 


Pro 


Val Asp 


Gin Thr Lys Lys 


Glu 






195 










200 






205 




Ala 


Glu 


Pro 


He 


Pro 


Glu 


Thr 


Val 


Lys 


Pro Glu 


Glu Lys Glu Thr 


Thr 




210 










215 








220 




Lys 


Asn 


Val 


Gin 


Gin 


Thr 


Val 


Ser 


Ala 


Lys Gly 


Pro Pro Glu Lys 


Arg 


225 










230 








235 




240 


Met 


Arg 


Leu 


Gin 



















<210> 47 

<211> 14 

<212> DNA 

<213> Homo sapien 



<400> 47 
tttttttttt ttag 



<210> 48 

<211> 10 

<212> DNA 

<213> Homo sapien 



<400> 48 
cttcaacctc 



<210> 49 

<211> 496 

<212> DNA 

<213> Homo sapien 



<400> 49 
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gcaccatgta ccgagcactt cggctcctcg cgcgctcgcg tcccctcgtg cgggctccag 60 

ccgcagcctt agcttcggct cccggcttgg gtggcgcggc cgtgccctcg ttttggcctc 120 

cgaacgcggc tcgaatggca agccaaaatt ccttccggat agaatatgat acctttggtg 180 

aactaaaggt gccaaatgat aagtattatg gcgcccagac cgtgagatct acgatgaact 240 

ttaagattgg aggtgtgaca gaacgcatgc caaccccagt tattaaagct tttggcatct 300 

tgaagcgagc ggccgctgaa gtaaaccagg attatggtct tgatccaaag attgctaatg 360 

caataatgaa ggcagcagat gaggtagctg aaggtaaatt aaatgatcat tttcctctcg 420 

tggtatggca gactggatca ggaactcaga caaatatgaa tgtaaatgaa gtcattagcc 480 

aatagagcaa ttgaaa 496 

<210> 50 

<211> 499 

<212> DNA 

<213> Homo sapien 

<400> 50 

agaaaaagtc tatgtttgca gaaatacaga tccaagacaa agacaggatg ggcactgctg 60 

gaaaagttat taaatgcaaa gcagctgtgc tttgggagca gaagcaaccc ttctccattg 120 

aggaaataga agttgcccca ccaaagacta aagaagttcg cattaagatt ttggccacag 180 

gaatctgtcg cacagatgac catgtgataa aaggaacaat ggtgtccaag tttccagtga 240 

ttgtgggaca tgaggcaact gggattgtag agagcattgg agaaggagtg actacagtga 300 

aaccaggtga caaagtcatc cctctctttc tgccacaatg tagagaatgc aatgcttgtc 360 

gcaacccaga tggcaacctt tgcattagga gcgatattac tggtcgtgga gtactggctg 420 

atggcaccac cagatttaca tgcaagggcg aaccagtcca ccacttcatg aacaccagta 480 

catttaccga gtacacagt 499 

<210> 51 

<211> 887 

<212> DNA 

<213> Homo sapien 

<400> 51 

gagtctgagc agaaaggaaa agcagccttg gcagccacgt tagaggaata caaagccaca 60 

gtggccagtg accagataga gatgaatcgc ctgaaggctc agctggagaa tgaaaagcag 120 

aaagtggcag agctgtattc tatccataac tctggagaca aatctgatat tcaggacctc 180 

ctggagagtg tcaggctgga caaagaaaaa gcagagactt tggctagtag cttgcaggaa 240 

gatctggctc atacccgaaa tgatgccaat cgattacagg atgccattgc taaggtagag 300 

gatgaatacc gagccttcca agaagaagct aagaaacaaa ttgaagattt gaatatgacg 360 

ttagaaaaat taagatcaga cctggatgaa aaagaaacag aaaggagtga catgaaagaa 420 

accatctttg aacttgaaga tgaagtagaa caacatcgtg ctgtgaaact tcatgacaac 480 

ctcattattt ctgatctaga gaatacagtt aaaaaactcc aggaccaaaa gcacgacatg 540 

gaaagagaaa taaagacact ccacagaaga cttcgggaag aatctgcgga atggcggcag 600 

tttcaggctg atctccagac tgcagtagtc attgcaaatg acattaaatc tgaagcccaa 660 

gaggagattg gtgatctaaa gcgccggtta catgaggctc aagaaaaaaa tgagaaactc 720 

acaaaagaat tggaggaaat aaagtcacgc aagcaagagg aggagcgagg cgggtataca 780 

attacatgaa tgccgttgag agagatttgg cagccttaag gcagggaatg ggactgagta 840 

gaaggtcctc gacttcctca gagccaactc ctacagtaaa aaccctc 887 

<210> 52 

<211> 491 

<212> DNA 

<213> Homo sapien 

<400> 52 

ggcacgagct tttccaaaaa tcatgctgct cctttctcta aagttcttac attttataga 60 

aaggaacctt tcactcttga ggcctactac agctctcctc aggatttgcc ctatccagat 120 

cctgctatag ctcagttttc agttcagaaa gtcactcctc agtctgatgg ctccagttca 180 

aaagtgaaag tcaaagttcg agtaaatgtc catggcattt tcagtgtgtc cagtgcatct 240 
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ttagtggagg ttcacaagtc tgaggaaaat gaggagccaa tggaaacaga tcagaatgca 300 

aaggaggaag agaagatgca agtggaccag gaggaaccac atgttgaaga gcaacagcag 360 

cagacaccag gcagaaaata aggcagagtc tgaagaaatg gagacctctc aagctggatc 420 

caaggataaa aagatggacc aaccacccca agccaagaag gcaaaagtga agaccagtac 480 

tgtggacctg g 491 

<210> 53 
<211> 787 
<212> DNA 
<213> Homo sapien 

<400> 53 

aagcagttga gtaggcagaa aaaagaacct cttcattaag gattaaaatg tataggccag 60 

cacgtgtaac ttcgacttca agatttctga atccatatgt agtatgtttc attgtcgtcg 120 

caggggtagt gatcctggca gtcaccatag ctctacttgt ttacttttta gcttttgatc 180 

aaaaatctta cttttatagg agcagttttc aactcctaaa tgttgaatat aatagtcagt 240 

taaattcacc agctacacag gaatacagga ctttgagtgg aagaattgaa tctctgatta 300 

ctaaaacatt caaagaatca aatttaagaa atcagttcat cagagctcat gttgccaaac 360 

tgaggcaaga tggtagtggt gtgagagcgg atgttgtcat gaaatttcaa ttcactagaa 420 

ataacaatgg agcatcaatg aaaagcagaa ttgagtctgt tttacgacaa atgctgaata 480 

actctggaaa cctggaaata aacccttcaa ctgagataac atcacttact gaccaggctg 540 

cagcaaattg gcttattaat gaatgtgggg ccggtccaga cctaataaca ttgtctgagc 600 

agagaatcct tggaggcact gaggctgagg agggaagctg gccgtggcaa gtcagtctgc 660 

ggctcaataa tgcccaccac tgtggaggca gcctgatcaa taacatgtgg atcctgacag 720 

cagctcactg cttcagaagc aactctaatc ctcgtgactg gattgccacg tctggtattt 780 

ccacaac 787 

<210> 54 

<211> 386 

<212> DNA 

<213> Homo sapien 

<400> 54 

ggcattttca gtgtgtccag tgcatcttta gtggaggttc acaagtctga ggaaaatgag 60 

gagccaatgg aaacagatca gaatgcaaag gaggaagaga agatgcaagt ggaccaggag 120 

gaaccacatg ttgaagagca acagcagcag acaccagcag aaaataaggc agagtctgaa 180 

gaaatggaga cctctcaagc tggatccaag gataaaaaga tggaccaacc accccaagcc 240 

aagaaggcaa aagtgaagac cagtactgtg gacctgccaa tcgagaatca gctattatgg 300 

cagatagaca gagagatgct caacttgtac attgaaaatg agggtaagat gatcatgcag 360 

gataaactgg agaaggagcg gaatga 386 

<210> 55 
<211> 1462 
<212> DNA 
<213> Homo sapien 

<400> 55 

aagcagttga gtaggcagaa aaaagaacct cttcattaag gattaaaatg tataggccag 60 

cacgtgtaac ttcgacttca agatttctga atccatatgt agtatgtttc attgtcgtcg 120 

caggggtagt gatcctggca gtcaccatag ctctacttgt ttacttttta gcttttgatc 180 

aaaaatctta cttttatagg agcagttttc aactcctaaa tgttgaatat aatagtcagt 240 

taaattcacc agctacacag gaatacagga ctttgagtgg aagaattgaa tctctgatta 300 

ctaaaacatt caaagaatca aatttaagaa atcagttcat cagagctcat gttgccaaac 360 

tgaggcaaga tggtagtggt gtgagagcgg atgttgtcat gaaatttcaa ttcactagaa 420 

ataacaatgg agcatcaatg aaaagcagaa ttgagtctgt tttacgacaa atgctgaata 480 

actctggaaa cctggaaata aacccttcaa ctgagataac atcacttact gaccaggctg 540 

cagcaaattg gcttattaat gaatgtgggg ccggtccaga cctaataaca ttgtctgagc 600 

agagaatcct tggaggcact gaggctgagg agggaagctg gccgtggcaa gtcagtctgc 660 
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ggctcaataa tgcccaccac tgtggaggca gcctgatcaa taacatgtgg atcctgacag 720 

cagctcactg cttcagaagc aactctaatc ctcgtgactg gattgccacg tctggtattt 780 

ccacaacatt tcctaaacta agaatgagag taagaaatat tttaattcat aacaattata 840 

aatctgcaac tcatgaaaat gacattgcac ttgtgagact tgagaacagt gtcaccttta 900 

ccaaagatat ccatagtgtg tgtctcccag ctgctaccca gaatattcca cctggctcta 960 

ctgcttatgt aacaggatgg ggcgctcaag aatatgctgg ccacacagtt ccagagctaa 1020 

ggcaaggaca ggtcagaata ataagtaatg atgtatgtaa tgcaccacat agttataatg 1080 

gagccatctt gtctggaatg ctgtgtgctg gagtacctca aggtggagtg gacgcatgtc 1140 

agggtgactc tggtggccca ctagtacaag aagactcacg gcggctttgg tttattgtgg 1200 

ggatagtaag ctggggagat cagtgtggcc tgccggataa gccaggagtg tatactcgag 1260 

tgacagcata cattgactgg attaggcaac aaactgggat ctagtgcaac aagtgcatcc 1320 

ctgttgcaaa gtctgtatgc aggtgtgcct gtcttaaatt ccaaagcttt acatttcaac 1380 

tgaaaaagaa actagaaatg tcctaattta acatcttgtt acataaatat ggtttaacaa 1440 

aaaaaaaaaa aaaaaactcg ag 1462 
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<210> 58 
<211> 259 
<212> PRT 
<213> Homo sapien 
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<210> 59 
<211> 125 
<212> PRT 
<213> Homo sapien 
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<210> 63 
<211> 776 
<212> DNA 
<213> Homo sapien 

<400> 63 

cacagatggt gatagaggaa tccatcttgc agtcagataa agccctcact gatagagaga 60 

aggcagtagc agtggatcgg gccaagaagg aggcagctga gaaggaacag gaacttttaa 120 

aacagaaatt acaggagcag ccagcaacag atggaggctc aagataagag tcgcaaggaa 180 

aactagccaa ctgaaggaga agctgcagat ggagagagaa cacctactga gagagcagat 240 

tatgatgttg gagcacacgc agaaggtcca aaatgattgg cttcatgaag gatttaagaa 300 

gaagtatgag gagatgaatg cagagataag tcaatttaaa cgtatgattg atactacaaa 360 

aaatgatgat actccctgga ttgcacgaac cttggacaac cttgccgatg agctaactgc 420 

aatattgtct gctcctgcta aattaattgg tcatggtgtc aaaggtgtga gctcactctt 480 

taaaaagcat aagctcccct tttaaggata ttatagattg tacatatatg ctttggacta 540 

tttttgatct gtatgttttt cattttcatt cagcaagttt tttttttttt tcagagtctt 600 

actctgttgc ccaggctgga gtacagtggt gcaatctcag ctcactgcaa cctctgcctc 660 

ctgggttcaa gagattcacc tgcctcagcc ccctagtagc tgggattata ggtgtacacc 720 

accacaccca gctaattttt gtatttttag tagagatggg gtttcactat gttggc 776 

<210> 64 
<211> 160 
<212> DNA 
<213> Homo sapien 

<400> 64 

gcagcgctct cggttgcagt acccactgga aggacttagg cgctcgcgtg gacaccgcaa 60 

gcccctcagt agcctcggcc caagaggcct gctttccact cgctagcccc gccgggggtc 120 

cgtgtcctgt ctcggtggcc ggacccgggc ccgagcccga 160 



<210> 65 
<211> 72 
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<212> PRT 

<213> Homo sapien 
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<210> 66 
<211> 2581 
<212> DNA 
<213> Homo sapien 



<400> 66 

ctttcaaccc gcgctcgccg gctccagccc cgcgcgcccc caccccttgc cctcccggcg 60 

gctccgcagg gtgaggtggc tttgaccccg ggttgcccgg ccagcacgac cgaggaggtg 120 

gctggacagc tggaggatga acggagaagc cgactgcccc acagacctgg aaatggccgc 180 

ccccaaaggc caagaccgtt ggtcccagga agacatgctg actttgctgg aatgcatgaa 240 

gaacaacctt ccatccaatg acagctccaa gttcaaaacc accgaatcac acatggactg 300 

ggaaaaagta gcatttaaag acttttctgg agacatgtgc aagctcaaat gggtggagat 360 

ttctaatgag gtgaggaagt tccgtacatt gacagaattg atcctcgatg ctcaggaaca 420 

tgttaaaaat ccttacaaag gcaaaaaact caagaaacac ccagacttcc caaagaagcc 480 

cctgacccct tatttccgct tcttcatgga gaagcgggcc aagtatgcga aactccaccc 540 

tgagatgagc aacctggacc taaccaagat tctgtccaag aaatacaagg agcttccgga 600 

gaagaagaag atgaaatata ttcaggactt ccagagagag aaacaggagt tcgagcgaaa 660 

cctggcccga ttcagggagg atcaccccga cctaatccag aatgccaaga aatcggacat 720 

cccagagaag cccaaaaccc cccagcagct gtggtacacc cacgagaaga aggtgtatct 780 

caaagtgcgg ccagatgcca ctacgaagga ggtgaaggac tccctgggga agcagtggtc 840 

tcagctctcg gacaaaaaga ggctgaaatg gattcataag gccctggagc agcggaagga 900 

gtacgaggag atcatgagag actatatcca gaagcaccca gagctgaaca tcagtgagga 960 

gggtatcacc aagtccaccc tcaccaaggc cgaacgccag ctcaaggaca agtttgacgg 1020 

gcgacccacc aagccacctc cgaacagcta ctcgctgtac tgcgcagagc tcatggccaa 1080 

catgaaggac gtgcccagca cagagcgcat ggtgctgtgc agccagcagt ggaagctgct 1140 

gtcccagaag gagaaggacg cctatcacaa gaagtgtgat cagaaaaaga aagattacga 1200 

ggtggagctg ctccgtttcc tcgagagcct gcctgaggag gagcagcagc gggtcttggg 1260 

ggaagagaag atgctgaaca tcaacaagaa gcaggccacc agccccgcct ccaagaagcc 1320 

agcccaggaa gggggcaagg gcggctccga gaagcccaag cggcccgtgt cggccatgtt 1380 

catcttctcg gaggagaaac ggcggcagct gcaggaggag cggcctgagc tctccgagag 1440 

cgagctgacc cgcctgctgg cccgaatgtg gaacgacctg tctgagaaga agaaggccaa 1500 

gtacaaggcc cgagaggcgg cgctcaaggc tcagtcggag aggaagcccg gcggggagcg 1560 

cgaggaacgg ggcaagctgc ccgagtcccc caaaagagct gaggagatct ggcaacagag 1620 

cgttatcggc gactacctgg cccgcttcaa gaatgaccgg gtgaaggcct tgaaagccat 1680 

ggaaatgacc tggaataaca tggaaaagaa ggagaaactg atgtggatta agaaggcagc 1740 

cgaagaccaa aagcgatatg agagagagct gagtgagatg cgggcacctc cagctgctac 1800 

aaattcttcc aagaagatga aattccaggg agaacccaag aagcctccca tgaacggtta 1860 

ccagaagttc tcccaggagc tgctgtccaa tggggagctg aaccacctgc cgctgaagga 1920 

gcgcatggtg gagatcggca gtcgctggca gcgcatctcc cagagccaga aggagcacta 1980 

caaaaagctg gccgaggagc agcaaaagca gtacaaggtg cacctggacc tctgggttaa 2040 

gagcctgtct ccccaggacc gtgcagcata taaagagtac atctccaata aacgtaagag 2100 

catgaccaag ctgcgaggcc caaaccccaa atccagccgg actactctgc agtccaagtc 2160 

ggagtccgag gaggatgatg aagaggatga ggatgacgag gacgaggatg aagaagagga 2220 
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agatgatgag aatggggact cctctgaaga tggcggcgac tcctctgagt ccagcagcga 2280 

ggacgagagc gaggatgggg atgagaatga agaggatgac gaggacgaag acgacgacga 2340 

ggatgacgat gaggatgaag ataatgagtc cgagggcagc agctccagct cctcctcctt 2400 

aggggactcc tcagactttg actccaactg aggcttagcc ccaccccagg ggagccaggg 2460 

agagcccagg agctcccctc cccaactgac cacctttgtt tcttccccat gttctgtccc 2520 

ttgcccccct ggcctccccc actttctttc tttctttaaa aaaaaaaaaa aaaaactcga 2580 

g " 2581 
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<210> 68 

<211> 434 
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<213> Homo sapien 
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<400> 68 



ctaagatgct ggatgctgaa gacatcgtcg gaactgcccg 
tgacttatgt gtctagcttc tatcatgcct tctctggagc 
ccaatcgcat ctgcaaagtg ttggcggtca atcaagagaa 
atgagaagct ggccagtgat ctgttggagt ggatccgccg 
atcgggtgcc tgagaacacc atgcatgcca tgcagcagaa 
atagacgcct gcacaagccg cccaaggtgc aggagaagtg 
acacgctgca gaccaaactg cggctcagca accggcctgc 
ggatggtctc ggat 



gccagatgag aaagccatta 
ccagaaggca gaaacagcag 
cgagcagctt atggaagact 
caccatccca tggctggaga 
gctggaggac ttccgagact 
ccagctggag atcaacttta 
cttcatgccc tccgagggca 



60 
120 
180 
240 
300 
360 
420 
434 



<210> 69 

<211> 244 

<212> DNA 

<213> Homo sapien 

<400> 69 

aggcagcatg ctcgttgaga gtcatcacca ctccctaatc tcaagtacgc agggacacaa 60 

acactgcgga aggccgcagg gtcctctgcc taggaaaacc agagaccttt gttcacttgt 120 

ttatgtgctg accttccctc cactattgtc ctgtgaccct gccaaatccc cctttgtgag 180 

aaacacccaa gaatgatcaa taaaaaataa attaatttag gaaaaaaaaa aaaaaaaact 240 

cgag 244 

<210> 70 

<211> 437 

<212> DNA 

<213> Homo sapien 



<210> 71 

<211> 271 

<212> DNA 

<213> Homo sapien 

<400> 71 

gcgcagagtt ctgtcgtcca ccatcgagtg aggaagagag cattggttcc cctgagatag 60 

aagagatggc tctcttcagt gcccagtctc catacattaa cccgatcatc ccctttactg 120 

gaccaatcca aggagggctg caggagggac ttcaggtgac cctccagggg actaccgaga 180 

gttttgcaca aaagtttgtg gtgaactttt cagaacagct tcaatggaga tgacttggcc 240 

ttccacttca accccggtta tgaggaagga g 271 

<210> 72 

<211> 290 

<212> DNA 

<213> Homo sapien 



<400> 70 



ctgggacggg agcgtccagc gggactcgaa ccccagatgt 

cttggtccct ggatccagcg tcggccagcc cagagcccgt 

ccaggcagtg ggaccccgcg agctgcacgt ccctgggcac 

tggggctgtg ccaggtgcgg ggtgggctgc cccctttctc 

cgtggccccc aggccggagt cttcctaagg ctgtgaggcc 

tctcgcagca gcagaccttg cccgtgatga gcggggaggc 

ctggttccct ggccatgggg gctgcacctc tgggggagcc 

tggcgcagga agccggg 



gaaggcgttt ctggaaagtc 
gccgcacatc cttgcgtcct 
ggacaagtgt gaggcactgt 
agaaccttcc agcctggtgc 
acccctgtcc tggcctccgt 
ccttggctgg ctgggccagg 
agccaaggag gaccccatgc 



60 
120 
180 
240 
300 
360 
420 
437 



<400> 72 



ccgagcccta cccggaggtc tccagaatcc ccaccgtcag 
ctggtgccct ctcctgctgc gaggactcgg cccagggctc 
cggtggccga gggtcccagc tcctgccttc ggcggaacgt 



gggatgcaac ggctccctgt 
gggcccgccc aaggccccta 
gatcagcgag agggagcgca 



60 
120 
180 
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ggaagcggat gtcgttgagc tgtgagcgtc tgcgggccct gctgccccag ttcgatggcc 240 
ggcgggagga catggcctcg gtcctggaga tgtctgttgc aattcctgcg 290 
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<210> 78 
<211> 2076 
<212> DNA 
<213> Homo sapien 

<400> 78 

agaaaaagtc tatgtttgca gaaatacaga tccaagacaa agacaggatg ggcactgctg 60 

gaaaagttat taaatgcaaa gcagctgtgc tttgggagca gaagcaaccc ttctccattg 120 

aggaaataga agttgcccca ccaaagacta aagaagttcg cattaagatt ttggccacag 180 

gaatctgtcg cacagatgac catgtgataa aaggaacaat ggtgtccaag tttccagtga 240 
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ttgtgggaca tgaggcaact gggattgtag agagcattgg agaaggagtg actacagtga 300 

aaccaggtga caaagtcatc cctctctttc tgccacaatg tagagaatgc aatgcttgtc 360 

gcaacccaga tggcaacctt tgcattagga gcgatattac tggtcgtgga gtactggctg 420 

atggcaccac cagatttaca tgcaagggca aaccagtcca ccacttcatg aacaccagta 480 

catttaccga gtacacagtg gtggatgaat cttctgttgc taagattgat gatgcagctc 540 

ctcctgagaa agtctgttta attggctgtg ggttttccac tggatatggc gctgctgtta 600 

aaactggcaa ggtcaaacct ggttccactt gcgtcgtctt tggcctgaga ggagttggcc 660 

tgtcagtcat catgggctgt aagtcagctg gtgcatctag gatcattggg attgacctca 720 

acaaagacaa atttgagaag gccatggctg taggtgccac tgagtgtatc agtcccaagg 780 

actctaccaa acccatcagt gaggtgctgt cagaaatgac aggcaacaac gtgggataca 840 

cctttgaagt tattgggcat cttgaaacca tgattgatgc cctggcatcc tgccacatga 900 

actatgggac cagcgtggtt gtaggagttc ctccatcagc caagatgctc acctatgacc 960 

cgatgttgct cttcactgga cgcacatgga agggatgtgt ctttggaggt ttgaaaagca 1020 

gagatgatgt cccaaaacta gtgactgagt tcctggcaaa gaaatttgac ctggaccagt 1080 

tgataactca tgtcttacca tttaaaaaaa tcagtgaagg atttgagctg ctcaattcag 1140 

gacaaagcat tcgaacggtc ctgacgtttt gagatccaaa gtggcaggag gtctgtgttg 1200 

tcatggtgaa ctggagtttc tcttgtgaga gttccctcat ctgaaatcat gtatctgtct 1260 

cacaaataca agcataagta gaagatttgt tgaagacata gaacccttat aaagaattat 1320 

taacctttat aaacatttaa agtcttgtga gcacctggga attagtataa taacaatgtt 1380 

aatatttttg atttacattt tgtaaggcta taattgtatc ttttaagaaa acatacactt 1440 

ggatttctat gttgaaatgg agatttttaa gagttttaac cagctgctgc agatatatat 1500 

ctcaaaacag atatagcgta taaagatata gtaaatgcat ctcctagagt aatattcact 1560 

taacacattg aaactattat tttttagatt tgaatataaa tgtatttttt aaacacttgt 1620 

tatgagttaa cttggattac attttgaaat cagttcattc catgatgcat attactggat 1680 

tagattaaga aagacagaaa agattaaggg acgggcacat ttttcaacga ttaagaatca 1740 

tcattacata acttggtgaa actgaaaaag tatatcatat gggtacacaa ggctatttgc 1800 

cagcatatat taatatttta gaaaatattc cttttgtaat actgaatata aacatagagc 1860 

tagaatcata ttatcatact tatcataatg ttcaatttga tacagtagaa ttgcaagtcc 1920 

ttaagtccct attcactgtg cttagtagtg actccattta ataaaaagtg tttttagttt 1980 

ttaacaacta cactgatgta tttatatata tttataacat gttaaaaatt tttaaggaaa 2040 

ttaaaaatta tataaaaaaa aaaaaaaaaa ctcgag 2076 

<210> 79 
<211> 2790 
<212> DNA 
<213> Homo sapien 

<400> 79 

aagcagttga gtaggcagaa aaaagaacct cttcattaag gattaaaatg tataggccag 60 

cacgtgtaac ttcgacttca agatttctga atccatatgt agtatgtttc attgtcgtcg 120 

caggggtagt gatcctggca gtcaccatag ctctacttgt ttacttttta gcttttgatc 180 

aaaaatctta cttttatagg agcagttttc aactcctaaa tgttgaatat aatagtcagt 240 

taaattcacc agctacacag gaatacagga ctttgagtgg aagaattgaa tctctgatta 300 

ctaaaacatt caaagaatca aatttaagaa atcagttcat cagagctcat gttgccaaac 360 

tgaggcaaga tggtagtggt gtgagagcgg atgttgtcat gaaatttcaa ttcactagaa 420 

ataacaatgg agcatcaatg aaaagcagaa ttgagtctgt tttacgacaa atgctgaata 480 

actctggaaa cctggaaata aacccttcaa ctgagataac atcacttact gaccaggctg 540 

cagcaaattg gcttattaat gaatgtgggg ccggtccaga cctaataaca ttgtctgagc 600 

agagaatcct tggaggcact gaggctgagg agggaagctg gccgtggcaa gtcagtctgc 660 

ggctcaataa tgcccaccac tgtggaggca gcctgatcaa taacatgtgg atcctgacag 720 

cagctcactg cttcagaagc aactctaatc ctcgtgactg gattgccacg tctggtattt 780 

ccacaacatt tcctaaacta agaatgagag taagaaatat tttaattcat aacaattata 840 

aatctgcaac tcatgaaaat gacattgcac ttgtgagact tgagaacagt gtcaccttta 900 

ccaaagatat ccatagtgtg tgtctcccag ctgctaccca gaatattcca cctggctcta 960 

ctgcttatgt aacaggatgg ggcgctcaag aatatgctgg ccacacagtt ccagagctaa 1020 

ggcaaggaca ggtcagaata ataagtaatg atgtatgtaa tgcaccacat agttataatg 1080 

gagccatctt gtctggaatg ctgtgtgctg gagtacctca aggtggagtg gacgcatgtc 1140 

agggtgactc tggtggccca ctagtacaag aagactcacg gcggctttgg tttattgtgg 1200 
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ggatagtaag ctggggagat cagtgtggcc tgccggataa gccaggagtg tatactcgag 1260 

tgacagccta ccttgactgg attaggcaac aaactgggat ctagtgcaac aagtgcatcc 1320 

ctgttgcaaa gtctgtatgc aggtgtgcct gtcttaaatt ccaaagcttt acatttcaac 1380 

tgaaaaagaa actagaaatg tcctaattta acatcttgtt acataaatat ggtttaacaa 1440 

acactgttta acctttcttt attattaaag gttttctatt ttctccagag aactatatga 1500 

atgttgcata gtactgtggc tgtgtaacag aagaaacaca ctaaactaat tacaaagtta 1560 

acaatttcat tacagttgtg ctaaatgccc gtagtgagaa gaacaggaac cttgagcatg 1620 

tatagtagag gaacctgcac aggtctgatg ggtcagaggg gtcttctctg ggtttcactg 1680 

aggatgagaa gtaagcaaac tgtggaaaca tgcaaaggaa aaagtgatag aataatattc 1740 

aagacaaaaa gaacagtatg aggcaagaga aatagtatgt atttaaaatt tttggttact 1800 

caatatctta tacttagtat gagtcctaaa attaaaaatg tgaaactgtt gtactatacg 1860 

tataacctaa ccttaattat tctgtaagaa catgcttcca taggaaatag tggataattt 1920 

tcagctattt aaggcaaaag ctaaaatagt tcactcctca actgagaccc aaagaattat 1980 

agatattttt catgatgacc catgaaaaat atcactcatc tacataaagg agagactata 2040 

tctattttat agagaagcta agaaatatac ctacacaaac ttgtcaggtg ctttacaact 2100 

acatagtact ttttaacaac aaaataataa ttttaagaat gaaaaattta atcatcggga 2160 

agaacgtccc actacagact tcctatcact ggcagttata tttttgagcg taaaagggtc 2220 

gtcaaacgct aaatctaagt aatgaattga aagtttaaag agggggaaga gttggtttgc 2280 

aaaggaaaag tttaaatagc ttaatatcaa tagaatgatc ctgaagacag aaaaaacttt 2340 

gtcactcttc ctctctcatt ttctttctct ctctctcccc ttctcataca catgcctccc 2400 

cgaccaaaga atataatgta aattaaatcc actaaaatgt aatggcatga aaatctctgt 2460 

agtctgaatc actaatattc ctgagttttt atgagctcct agtacagcta aagtttgcct 2520 

atgcatgatc atctatgcgt cagagcttcc tccttctaca agctaactcc ctgcatctgg 2580 

gcatcaggac tgctccatac atttgctgaa aacttcttgt atttcctgat gtaaaattgt 2640 

gcaaacacct acaataaagc catctacttt tagggaaagg gagttgaaaa tgcaaccaac 2700 

tcttggcgaa ctgtacaaac aaatctttgc tatactttat ttcaaataaa ttctttttga 2760 

aatgaaaaaa aaaaaaaaaa aaaactcgag 2790 

<210> 80. 
<211> 1460 
<212> DNA 
<213> Homo sapien 

<400> 80 

ctcaaagcag ttgagtaggc agaaaaaaga acctcttcat taaggattaa aatgtatagg 60 

ccagcacgtg taacttcgac ttcaagattt ctgaatccat atgtagtatg tttcattgtc 120 

gtcgcagggg tagtgatcct ggcagtcacc atagctctac ttgtttactt tttagctttt 180 

gatcaaaaat cttactttta taggagcagt tttcaactcc taaatgttga atataatagt 240 

cagttaaatt caccagctac acaggaatac aggactttga gtggaagaat tgaatctctg 300 

attactaaaa cattcaaaga atcaaattta agaaatcagt tcatcagagc tcatgttgcc 360 

aaactgaggc aagatggtag tggtgtgaga gcggatgttg tcatgaaatt tcaattcact 420 

agaaataaca atggagcatc aatgaaaagc agaattgagt ctgttttacg acaaatgctg 480 

aataactctg gaaacctgga aataaaccct tcaactgaga taacatcact tactgaccag 540 

gctgcagcaa attggcttat taatgaatgt ggggccggtc cagacctaat aacattgtct 600 

gagcagagaa tccttggagg cactgaggct gaggagggaa gctggccgtg gcaagtcagt 660 

ctgcggctca ataatgccca ccactgtgga ggcagcctga tcaataacat gtggatcctg 720 

acagcagctc actgcttcag aagcaactct aatcctcgtg actggattgc cacgtctggt 780 

atttccacaa catttcctaa actaagaatg agagtaagaa atattttaat tcataacaat 840 

tataaatctg caactcatga aaatgacatt gcacttgtga gacttgagaa cagtgtcacc 900 

tttaccaaag atatccatag tgtgtgtctc ccagctgcta cccagaatat tccacctggc 960 

tctactgctt atgtaacagg atggggcgct caagaatatg ctggccacac agttccagag 1020 

ctaaggcaag gacaggtcag aataataagt aatgatgtat gtaatgcacc acatagttat 1080 

aatggagcca tcttgtctgg aatgctgtgt gctggagtac ctcaaggtgg agtggacgca 1140 

tgtcagggtg actctggtgg cccactagta caagaagact cacggcggct ttggtttatt 1200 

gtggggatag taagctgggg agatcagtgt ggcctgccgg ataagccagg agtgtatact 1260 

cgagtgacag cctaccttga ctggattagg caacaaactg ggatctagtg caacaagtgc 1320 

atccctgttg caaagtctgt atgcaggtgt gcctgtctta aattccaaag ctttacattt 1380 

caactgaaaa agaaactaga aatgtcctaa tttaacatct tgttacataa atatggttta 1440 
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acaaaaaaaa aaaaaaaaaa 1460 

<210> 81 

<211> 386 

<212> PRT 

<213> Homo sapien 



<400> 81 



Met 




Ala 




He 


Gin He Gin 
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Gly 
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Val 


He 
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Cys Lys Ala 
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Ala Val 


Leu Trp Glu Gin Lys Gin 
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Pro 
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Ser 


He 


Glu 


Glu He Glu 


Val Ala 


Pro Pro Lys Thr Lys Glu 
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40 




45 


Val 


Arg 


He 


L 


He 


Leu Ala Thr 


Gly He 






50 
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60 


Val 


lie 


L 


Gl 

Y 


Thr 


Met Val Ser 
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Pro Val He Val Gly His 


65 










70 




75 80 


Glu 


Ala 


Thr 


Gl 


He 


Val Glu Ser 


He Gly 


uiu vjj.y vai inr inr vax 










85 
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Lv 
ys 


Pro 




Asp 


Lys 


Val He Pro 




Leu Pro Gin Cys Arg Glu 








100 






105 
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Cv 
ys 
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Ala 


Cys 


rg 


Asn Pro As 


Gly Asn 


Leu Cys He Arg Ser Asp 
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SI1 r ° 120 
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Gly 


rg 


Gl 


Val Leu Ala 


Asp Gly 


Tnr Tnr Arg Pne Tnr Cys 
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Gly 


ys 
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ro 


Val 


His His Phe 


Met Asn 


Tnr Ser Tnr Pne Tnr Glu 
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Thr Gly Lys 
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Val 
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u ys e er ^ro ys 
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Ser 
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He Ser Glu 
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Val 


Gly 
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Phe Glu Val 


He Gly 


His Leu Glu Thr Met He 
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290 








295 




300 


Gly 


Val 


Pro 


Pro 


Ser 


Ala Lys Met 


Leu Thr 


Tyr Asp Pro Met Leu Leu 


305 










310 




315 320 


Phe 


Thr 


Gly 


Arg 


Thr 


Trp Lys Gly 


Cys Val 
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Val 
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Gly lie Val Ser Trp Gly Asp Gin Cys Gly Leu Pro Asp Lys Pro Gly 
385 390 395 400 

Val Tyr Thr Arg Val Thr Ala Tyr Leu Asp Trp lie Arg Gin Gin Thr 
405 410 415 

Gly He 



<210> 84 

<211> 489 

<212> DNA 

<213> Homo sapien 

<400> 84 

aaaagggtaa gcttgatgat taccaggaac gaatgaacaa aggggaaagg cttaatcaag 60 

atcagctgga tgccgtttct aagtaccagg aagtcacaaa taatttggag tttgcaaaag 120 

aattacagag gagtttcatg gcactaagtc aagatattca gaaaacaata aagaagacag 180 

cacgtcggga gcagcttatg agagaagaag ctgaacagaa acgtttaaaa actgtacttg 240 

agctacagta tgttttggac aaattgggag atgatgaagt gcggactgac ctgaaacaag 300 

gtttgaatgg agtgccaata ttgtccgaag aggagttgtc attgttggat gaattctata 360 

agctagtaga ccctgaacgg gacatgagct tgaggttgaa tgaacagtat gaacatgcct 420 

ccattcacct gtgggacctg ctggaaggga aggaaaaacc tgtatgtgga accacctata 480 

aagttctaa 489 

<210> 85 

<211> 304 

<212> DNA 

<213> Homo sapien 

<400> 85 

gggacctgga ggaggccacg ctgcagcatg aagccacagc agccaccctg aggaagaagc 60 

acgcggacag cgtggccgag ctcggggagc agatcgacaa cctgcagcgg gtgaagcaga 120 

agctggagaa ggagaagagc gagatgaaga tggagatcga tgacctcgct tgtaacatgg 180 

aggtcatctc caaatctaag ggaaaccttg agaagatgtg ccgcacactg gaggaccaag 240 

tgagtgagct gaagacccag gaggaggaac agcagcggct gatcaatgaa ctgactgcgc 300 

agag 304 

<210> 86 

<211> 296 

<212> DNA 

<213> Homo sapien 

<400> 86 

gaaaatcctt cctttgaatg ggaatctcca agcagttgaa ttgggcgaaa aaagaacctc 60 

ttccttaagg attaaaatgt ttagggcaac acgtgttact tccacttcca gatttctgaa 120 

tccatatgtt gtatgtttcc ttgtcctccc aggggttgtg atcctggcag tccccatagc 180 

tctacttgtt tactttttag cttttgatca aaaatcttac ttttattgga gcaattttcc 240 

actcccaaat gttgaatata atagtccgtt taattccccc gcttcaccgg gaattc 296 

<210> 87 

<211> 904 

<212> DNA 

<213> Homo sapien 

<400> 87 

gtgtccagga aacgattcat gaacataaca agcttgctgc aaattcagat catctcatgc 60 

agattcaaaa atgtgagttg gtcttgatcc acacctaccc agttggtgaa gacagccttg 120 

tatctgatcg ttctaaaaaa gagttgtccc cggttttaac cagtgaagtt catagtgttc 180 

gtgcaggacg gcatcttgct accaaattga atattttagt acagcaacat tttgacttgg 240 
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cttcaactac tattacaaat attccaatga aggaagaaca gcatgctaac acatctgcca 300 

attatgatgt ggagctactt catcacaaag atgcacatgt agatttcctg aaaagtggtg 360 

attcgcatct aggtggcggc agtcgagaag gctcgtttaa agaaacaata acattaaagt 420 

ggtgtacacc aaggacaaat aacattgaat tacactattg tactggagct tatcggattt 480 

cacctgtaga tgtaaatagt agaccttcct cctgccttac taattttctt ctaaatggtc 540 

gttctgtttt attggaacaa ccacgaaagt caggttctaa agtcattagt catatgctta 600 

gtagccatgg aggagagatt tttttgcacg tccttagcag ttctcgatcc attctagaag 660 

atccaccttc aattagtgaa ggatgtggag gaagagttac agactaccgg attacagatt 720 

ttggtgaatt tatgagggga aaacagatta actccttttc tacaccccag atataaaatc 780 

gatggaagtc ttgaggtccc tttggaaccg agccaaaaga tcagttaaaa aaacataccc 840 

gttactggcc tatgatttca aaaacccacc atttttaaca tgcaagcggt agttccgtta 900 

acca 904 

<210> 88 

<211> 387 

<212> DNA 

<213> Homo sapien 

<400> 88 

cgtctctccc ccagtttgcc gttcacccgg agcgctcggg acttgccgat agtggtgacg 60 

gcggcaacat gtctgtggct ttcgcggccc cgaggcagcg aggcaagggg gagatcactc 120 

ccgctgcgat tcagaagatg ttggatgaca ataaccatct tattcagtgt ataatggact 180 

ctcagaataa aggaaagacc tcagagtgtt ctcagtatca gcagatgttg cacacaaact 240 

tggtatacct tgctacaata gcagattcta atcaaaatat gcagtctctt ttaccagcac 300 

cacccacaca gaatatgcct atgggtcctg gagggatgaa tcagagcggg cctcccccac 360 

ctccacgctc tcacaacatg ccttcaa 387 

<210> 89 

<211> 481 

<212> DNA 

<213> Homo sapien 

<400> 89 

tgttcttgga cctgcggtgc tatagagcag gctcttctag gttggcagtt gccatggaat 60 

ctggacccaa aatgttggcc cccgtttgcc tggtggaaaa taacaatgag cagctattgg 120 

tgaaccagca agctatacag attcttgaaa agatttctca gccagtggtg gtggtggcca 180 

ttgtaggact gtaccgtaca gggaaatcct acttgatgaa ccatctggca ggacagaatc 240 

atggcttccc tctgggctcc acggtgcagt ctgaaaccaa gggcatctgg atgtggtgcg 300 

tgccccaccc atccaagcca aaccacaccc tggtccttct ggacaccgaa ggtctgggcg 360 

atgtggaaaa gggtgaccct aagaatgact cctggatctt tgccctggct gtgctcctgt 420 

gcagcacctt tgtctacaac agcatgagca ccatcaacca ccaggccctg gagcagctgc 480 

a " 481 

<210> 90 

<211> 491 

<212> DNA 

<213> Homo sapien 

<400> 90 

tgaaaactgt tcttggacct gcggtgctat agagcaggtt ggcagttgcc atggaatctg 60 

gacccaaaat gttggccccc gtttgcctgg tggaaaataa caatgagcag ctattggtga 120 

accagcaagc tatacagatt cttgaaaaga tttctcagcc agtggtggtg gtggccattg 180 

taggactgta ccgtacaggg aaatcctact tgatgaacca tctggcagga cagaatcatg 240 

gcttccctct gggctccacg gtgcagtctg aaaccaaggg catctggatg tggtgcgtgc 300 

cccacccatc caagccaaac cacaccctgg tccttctgga caccgaaggt ctgggcgatg 360 

tggaaaaggg tgaccctaag aatgactcct ggatctttgc cctggctgtg ctcctgtgca 420 

gcacctttgt ctacaacagc atgagcacca tcaaccacca agccctggag cagctgcatt 480 

atgtgacgga c 491 
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<210> 91 

<211> 488 

<212> DNA 

<213> Homo sapien 

<400> 91 

ttcgacagtc agccgcatct tcttttgcgt cgccagccga gccacatcgc tcagacacca 60 

tggggaaggt gaaggtcgga gtcaacggat ttggtcgtat tgggcgcctg gtcaccaggg 120 

ctgcttttaa ctctggtaaa gtggatattg ttgccatcaa tgaccccttc attgacctca 180 

actacatggt ttacatgttc caatatgatt ccacccatgg caaattccat ggcaccgtcg 240 

aggctgagaa cgggaagctt gtcatcaatg gaaatcccat caccatcttc caggagcgag 300 

atccctccaa aatcaagtgg ggcgatgctg gcgctgagta cgtcgtggag tccactggcg 360 

tcttcaccac catggagaag gctggggctc atttgcaggg gggagccaaa agggtcatca 420 

tctctgcccc tctgctgatg ccccatgttc gtcatgggtg tgaaccatga gaagtatgac 480 

acagcctc 488 

<210> 92 

<211> 384 

<212> DNA 

<213> Homo sapien 

<400> 92 

gacagtcagc cgcatcttct tttgcgtcgc cagccgagcc acatcgctca gacaccatgg 60 

ggaaggtgaa ggtcggagtc aacggatttg gtcgtattgg gcgcctggtc accagggctg 120 

cttttaactc tggtaaagtg gatattgttg ccatcaatga ccccttcatt gacctcaact 180 

acatggttta catgttccaa tatgattcca cccatggcaa attccatggc accgtcgagg 240 

ctgagaacgg gaagcttgtc atcaatggaa atcccatcac catcttccag gagcgagatc 300 

cctccaaaat caagtggggc gatactggcg ctgagtacgt cgtggagtcc actggcgtct 360 

tcaccaccat ggagaaggct gggg 384 
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<212> PRT 
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atggcggcgc ggtcgtcgtc gggggtggcg geggcagagg gggeggegge cctggcggca 60 

geggagaegg cagccgtgac ggtggcagcg gcggcgcggg acctgggcct gggggaatga 120 

ggcggccgcg gcgggccagc ggcggagccg tgtageggag aagctccccc tccctgcttc 180 

ccttggccga geegggggeg cgcgcgcacg cggccgtcca gagegggetc cccacccctc 240 

gactcctgcg acccgcaccg cacccccacc cgggcccgga ggatgatgaa gctcaagtcg 300 

aaccagaccc gcacctacga cggcgacggc tacaagaagc gggccgcatg cctgtgtttc 360 

egcagegaga gcgaggagga ggtgctactc gtgagcagta gtcgccatcc agacagatgg 420 

attgtccctg gaggaggcat ggageccgag gaggagecaa gtgtggcagc agttcgtgaa 480 

gtctgtgagg aggctggagt aaaagggaca ttgggaagat tagttggaat ttttgagaac 540 

caggagagga agcacaggac gtatgtctat gtgctcattg tcactgaagt gctggaagac 600 

tgggaagatt cagttaacat tggaaggaag agggaatggt ttaaaataga agacgecata 660 

aaagtgctgc agtatcacaa acccgtgcag gcatcatatt ttgaaacatt gaggcaaggc 720 

tactcagcca acaatggcac cccagtcgtg gccaccacat acteggttte tgetcagage 780 

tcgatgtcag gcatcagatg actgaagact tcctgtaaga gaaatggaaa ttggaaacta 840 

gactgaagtg caaatcttcc ctctcaccct ggctctttcc acttctcaca ggcctcctct 900 

ttcaaataag gcatggtggg cagcaaagaa agggtgtatt gataatgttg ctgtttggtg 960 

ttaagtgatg gggctttttc ttctgttttt attgagggtg ggggttgggt gtgtaatttg 1020 

taagtacttt tgtgcatgat ctgtccctcc ctcttcccac ccctgcagtc ctctgaagag 1080 
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aggccaacag ccttcccctg ccttggattc tgaagtgttc ctgtttgtct tatcctggcc 1140 
ctggccagac gttttctttg atttttaatt tttttttttt attaaaagat accagtatga 1200 
gaaaaaaaaa aaaaaaaaac tcgag 1225 

<210> 103 
<211> 741 
<212> DNA 
<213> Homo sapien 

<400> 103 
agaaacctca atcggattca 
atcaataact ggcagcaact 
atcctcgatg aagcacataa 
gctattcctg caagtaatcg 
gaactatggt ccctatttga 
tttaagatgg agtatgaaaa 
gaaaaagcct tgggatttaa 
ctcaggagga ctaaagaaga 
aatgaaaaga atccagatgt 
gatttaatta tttggatacg 
tctttagatc atatcaagga 
gtcttaaaga agctgtgtga 
aatcttggga cattctctgc 

<210> 104 
<211> 321 
<212> DNA 
<213> Homo sapien 

<400> 104 
ttgctctgcg tcatcaaaga caccaaactg 
cctcagatgg aactgccact ccaaggctgt 
aagaagaagc acgagctgaa gattactcag 
cagagcaagg aacaggccga gcagtggctg 
agtggccccg tggattcaga gtgtcctcct 
ctggagaaga aactgtcttc a 

<210> 105 
<211> 389 
<212> DNA 
.<213> Homo sapien 

<400> 105 

cagcactggc cacactataa aattcaggtt 

cgcttccagc atttattttc tttgcaccca 

cgaactctgt taaaggtaca gacagtacaa 

ggtgatagtc ttttgactta atatattatt 

aaggtcatta tttagaagat aatctgggtt 

gcacatgcta cttaatgtct ttaccaaata 

tgtataataa ggaaaagctg gcctataga 

<210> 106 
<211> 446 
<212> DNA 
<213> Homo sapien 

<400> 106 

gccacatttg ccctggtcat agtttaaaca ccaggtcctg tgtcacatct ttttggtgcc 60 



gcaaaggaat ggtgttatta tcactacata ccaaatgtta 60 

ttcaagcttt aggggccaag agtttgtgtg ggactatgtc 120 

aataaaaacc tcatctacta agtcagcaat atgtgctcgt 180 

cctcctcctc acaggaaccc caatccagaa taatttacaa 240 

ttttgcttgt caagggtccc tgctgggaac attaaaaact 300 

tcctattact agagcaagag agaaggatgc taccccagga 360 

aatatctgaa aacttaatgg caatcataaa accctatttt 420 

cgtacagaag aaaaagtcaa gcaacccaga ggccagactt 480 

tgatgccatt tgtgaaatgc cttccctttc caggagaaat 540 

acttgtgcct ttacaagaag aaatatacag gaaatttgtg 600 

gttgctaatg gagacgcgct cacctttggc tgagctaggt 660 

tcatcctagg ctgctgtctg cacgggcttg ttgtttgcta 720 

t 741 



ctgtgctata aaagttccaa 
aacattacgt acatcccgaa 
cagggcacgg acccgcttgt 
aaggtgatca aagaagccta 
ccaccaagct ccccggtgca 



ggaccagcag 60 
agacagcaaa 120 
tctcgccgtc 180 
cagtggttgt 240 
caaggcagaa 300 
321 



cagaaaaaca gggtaagtca 
tgggcaattt gagaaaattt 
tactttttat tcagaaggtt 
gtctcctgcc ttgtgtttct 
gtatttgtgt cgtcagattg 
ataacaaagg gaaagaaaac 



cagacagcaa 60 

acctttagaa 120 

tctgcataaa 180 

ggaatgaatg 240 

aattttcatt 300 

caaatataga 360 
389 



WO 02/092001 



49 



PCT7US02/14975 



acaagtatca ctccattgtt cagagagtaa tgtattagtt ctgcccaatt cattcttcac 120 

ttttatttct tccatttcat tagcatttat atcagctcaa gaagttaagg ttagaaaatt 180 

ttccacttca aattttcagt acagaaatgt gctgtgatgt ttgacaagac tatttcatag 240 

taagtgagtt aatgtttatt ggcctctgct ctcctctgtg tcagacctag gaagcctgag 300 

gattacttag ttgttctgtc tctgggtcca caggcagaat ttggcccatc caaagactgg 360 

ccaagtgcca aaaaaaggcc tgattaggcc ctgaaattca gtgaaattct gcctgaagaa 420 

acctcttatt gaatttgaaa accata 446 

<210> 107 
<211> 467 
<212> DNA 
<213> Homo sapien 

<400> 107 

ccgccgctgc cgtcgccttc ctgggattgg agtctcgagc tttcttcgtt cgttcgccgg 60 

cgggttcgcg cccttctcgc gcctcggggc tgcgaggctg gggaaggggt tggagggggc 120 

tgttgatcgc cgcgtttaag ttgcgctcgg ggcggccatg tcggccggcg aggtcgagcg 180 

cctagtgtcg gagctgagcg gcgggaccgg aggggatgag gaggaagagt ggctctatgg 240 

cgatgaagat gaagttgaaa ggccagaaga agaaaatgcc agtgctaatc ctccatctgg 300 

aattgaagat gaaactgctg aaaatggtgt accaaaaccg aaagtgactg agaccgaaga 360 

tgatagtgat agtgacagcg atgatgatga agatgatgtg catgtcacta taggagacat 420 

taaaacggga gcaccacagt atgggagtta tggtacagca cctgtaa 467 

<210> 108 

<211> 491 

<212> DNA 

<213> Homo sapien 

<400> 108 

gaaagataca acttccccaa cccaaacccg tttgtggagg acgacatgga taagaatgaa 60 

atcgcctctg ttgcgtaccg ttaccgcagg tggaagcttg gagatgatat tgaccttatt 120 

gtccgttgtg agcacgatgg cgtcatgact ggagccaacg gggaagtgtc cttcatcaac 180 

atcaagacac tcaatgagtg ggattccagg cactgtaatg gcgttgactg gcgtcagaag 240 

ctggactctc agcgaggggc tgtcattgcc acggagctga agaacaacag ctacaagttg 300 

gcccggtgga cctgctgtgc tttgctggct ggatctgagt acctcaagct tggttatgtg 360 

tctcggtacc acgtgaaaga ctcctcacgc cacgtcatcc taggcaccca gcagttcaag 420 

cctaatgagt ttgccagcca gatcaacctg agcgtggaga atgcctgagg cattttacgc 480 

tgcgtcattg a 491 

<210> 109 
<211> 489 
<212> DNA 
<213> Homo sapien 

<400> 109 

ctcagatagt actgaaccct ttatcaacta tgttttttca gtctgacaac caaggcggct 60 

actaagtgac taaggggcag gtagtataca gtgtggataa gcaggacaaa ggggtgattc 120 

acatcccagg caggacagag caggagatca tgagatttca tcactcagga tggcttgtga 180 

tttattttat tttattcttt tttttttttg agatggagtc tcactcttgc ccaggctgga 240 

gtgcagtggt gcgatcttgg ctcactgcaa cctctgcctc ctgggttcaa gcagttctcc 300 

tgcctcagcc tcccaagtag ctgggattac aggcgtccgc caccatgccc agccaatttt 360 

tgtactttta gtagagatgg ggtttcacca tgttggccag gctggtctcg aactcctgac 420 

ctcaggtgat ccactcgcct cggcctccca aagtgctggg attataggca tgcgccacca 480 

tgcccgggc 489 

<210> 110 
<211> 391 
<212> DNA 
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<213> Homo sapien 
<400> 110 

gcggagtccg ctggctgacc cgagcgctgg tctccgccgg gaaccctggg gcatggagag 60 

gtctgagtac ctcggccgcg gcgcacgctg catcgcggag ccaggctgcc gctgtcccag 120 

tggagttcca ggagcaccac ctgagtgagg tgcagaatat ggcatctgag gagaagctgg 180 

agcaggtgct gagttccatg aaggagaaca aagtggccat cattggaaag attcataccc 240 

cgatggagta taagggggag ctagcctcct atgatatgcg gctgaggcgt aagttggact 300 

tatttgccaa cgtaatccat gtgaagtcac ttcctgggta tatgactcgg cacaacaatc 360 

tagacctggt gatcattcga gagcagacag a 391 

<210> 111 
<211> 172 
<212> PRT 
<213> Homo sapien 



<400> 111 
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<210> 113 
<211> 107 
<212> PRT 
<213> Homo sapien 
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<210> 114 
<211> 155 
<212> PRT 
<213> Homo sapien 



<400> 114 
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<210> 115 
<211> 129 
<212> PRT 
<213> Homo sapien 

<400> 115 
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<210> 116 
<211> 550 
<212> DNA 
<213> Homo sapien 

<400> 116 

gaattcggca ccagcctcag agccccccag cccggctacc accccctgcg gaaaggtacc 60 

catctgcatt cctgcccgtc gggacctggt ggacagtcca gcctccttgg cctctagcct 120 

tggctcaccg ctgcctagag ccaaggagct catcctgaat gaccttcccg ccagcactcc 180 

tgcctccaaa tcctgtgact cctccccgcc ccaggacgct tccaccccca ggcccagctc 240 

ggccagtcac ctctgccagc ttgctgccaa gccagcacct tccacggaca gcgtcgccct 300 

gaggagcccc ctgactctgt ccagtccctt caccacgtcc ttcagcctgg gctcccacag 360 

cactctcaac ggagacctct ccgtgcccag ctcctacgtc agcctccacc tgtcccccca 420 

ggtcagcagc tctgtggtgt acggacgctc ccccgtgatg gcatttgagt ctcatcccca 480 

tctccgaggg tcatccgtct cttcctccct acccagcatc cctgggggaa agccggccta 540 

ctccttccac 550 

<210> 117 
<211> 154 
<212> DNA 
<213> Homo sapien 
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<400> 117 

ttctgaggga aagccgagtg gagtgggcga cccggcggcg gtgacaatga gttttcttgg 60 

aggctttttt ggtcccattt gtgagattga tgttgccctt aatgatgggg aaaccaggaa 120 

aatggcagaa atgaaaactg aggatggcaa agta 154 

<210> 118 
<211> 449 
<212> DNA 
<213> Homo sapien 

<400> 118 

gaattcggca ccagggcccg cagcccgagt gtcgccgcca tggcttcgcc gcagctctgc 60 

cgcgcgctgg tgtcggcgca atgggtggcg gaggcgctgc gggccccgcg cgctgggcag 120 

cctctgcagc tgctggacgc ctcctggtac ctgccgaagc tggggcgcga cgcgcgacgc 180 

gagttcgagg agcgccacat cccgggcgcc gctttcttcg acatcgacca gtgcagcgac 240 

cgcacctcgc cctacgacca catgctgccc ggggccgagc atttcgcgga gtacgcaggc 300 

cgcctgggcg tgggcgcggc cacccacgtc gtgatctacg acgccagcga ccagggcctc 360 

tactccgccc cgcgcgtctg gtggatgttc cgcgccttcg gccaccacgc cgtgtcactg 420 

cttgatggcg gcctccgcca ctggctgcg 449 

<210> 119 
<211> 642 
<212> DNA 
<213> Homo sapien 

<400> 119 

gaattcggca cgagcagtaa cccgaccgcc gctggtcttc gctggacacc atgaatcaca 60 

ctgtccaaac cttcttctct cctgtcaaca gtggccagcc ccccaactat gagatgctca 120 

aggaggagca cgaggtggct gtgctggggg cgccccacaa ccctgctccc ccgacgtcca 180 

ccgtgatcca catccgcagc gagacctccg tgcccgacca tgtcgtctgg tccctgttca 240 

acaccctctt catgaacccc tgctgcctgg gcttcatagc attcgcctac tccgtgaagt 300 

ctagggacag gaagatggtt ggcgacgtga ccggggccca ggcctatgcc tccaccgcca 360 

agtgcctgaa catctgggcc ctgattctgg gcatcctcat gaccattctg ctcatcgtca 420 

tcccagtgct gatcttccag gcctatggat agatcaggag gcatcactga ggccaggagc 480 

tctgcccatg acctgtatcc cacgtactcc aacttccatt cctcgccctg cccccggagc 540 

cgagtcctgt atcagccctt tatcctcaca cgcttttcta caatggcatt caataaagtg 600 

cacgtgtttc tggtgaaaaa aaaaaaaaaa aaaaaactcg ag 642 

<210> 120 
<211> 603 
.<212> DNA 
<213> Homo sapien 

<400> 120 

gaattcggca cgagccacaa cagccactac gactgcatcc actggatcca cggccacccc 60 

gtcctccacc ccgggaacag ctccccctcc caaagtgctg accagcccgg ccaccacacc 120 

catgtccacc atgtccacaa tccacacctc ctctactcca gagaccaccc acacctccac 180 

agtgctgacc accacagcca ccatgacaag ggccaccaat tccacggcca caccctcctc 240 

cactctgggg acgacccgga tcctcactga gctgaccaca acagccacta caactgcagc 300 

cactggatcc acggccaccc tgtcctccac cccagggacc acctggatcc tcacagagcc 360 

gagcactata gccaccgtga tggtgcccac cggttccacg gccaccgcct cctccactct 420 

gggaacagct cacaccccca aagtggtgac caccatggcc actatgccca cagccactgc 480 

ctccacggtt cccagctcgt ccaccgtggg gaccacccgc acccctgcag tgctccccag 540 

cagcctgcca accttcagcg tgtccactgt gtcctcctca gtcctcacca ccctgagacc 600 

cac 603 



<210> 121 
<211> 178 
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20 






SP SU 25 




er 


Ser 


Leu 


Gly 


Ser 


Pro 


Leu Pro Arg Ala 


Lys Glu Leu He Leu Asn 






35 












sp 


k® u 


Pro 


Ala 


Ser 


Thr 


Pro Ala Ser Lys 


Ser Cys Asp Ser Ser Pro 
















n • t r. 




G n 


Asp 


Ala 


Ser 


Thr 


Pro Arg Pro Ser 


Ser Ala Ser His Leu Cys 


^° 










70 






n 


Leu 


Ala 


Ala 


Lys 


Pro 


Ala Pro Ser Thr 


Asp Ser Val Ala Leu Arg 










85 








Ser 
er 


Pro 


Leu 


Thr 


Leu 


Ser 


Ser Pro Ph Thr 


Thr Ser Phe Ser Leu Gl 
r er e u y 








100 






Sr r ° 105 r 




Ser 
er 


His 


Ser 


Thr 


Leu 


Asn 


(jly Asp Leu Ser 


Val Pro Ser Ser Tyr Val 






115 








120 


125 


Ser 


Leu 


His 


Leu 


Ser 


Pro 


Gin Val Ser Ser 


Ser Val Val Tyr Gly Arg 




130 










135 


140 


Ser 


Pro 


Val 


Met 


Ala 


Phe 


Glu Ser His Pro 


His Leu Arg Gly Ser Ser 


145 










150 




155 160 


Val 


Ser 


Ser 


Ser 


Leu 


Pro 


Ser He Pro Gly 


Gly Lys Pro Ala Tyr Ser 










165 




170 


175 


Phe 


His 















<210> 122 

<211> 36 

<212> PRT 

<213> Homo sapien 





<400> 


122 










Met 


Ser Phe 


Leu 


Gly 


Gly 


Phe Phe Gly Pro 


He Cys Glu He Asp Val 


1 






5 




10 


15 


Ala 


Leu Asn 


Asp 


Gly 


Glu 


Thr Arg Lys Met 


Ala Glu Met Lys Thr Glu 






20 






25 


30 


Asp 


Gly Lys 


Val 












35 














<210> 


123 












<211> 


136 












<212> 


PRT 












<213> 


Home 


j sapien 








<400> 


123 










Met 


Ala Ser 


Pro 


Gin 


Leu 


Cys Arg Ala Leu 


Val Ser Ala Gin Trp Val 


1 






5 




10 


15 


Ala 


Glu Ala 


Leu 


Arg 


Ala 


Pro Arg Ala Gly 


Gin Pro Leu Gin Leu Leu 






20 






25 


30 


Asp 


Ala Ser 


Trp 


Tyr 


Leu 


Pro Lys Leu Gly 


Arg Asp Ala Arg Arg Glu 




35 








40 


45 


Phe 


Glu Glu 


Arg 


His 


He 


Pro Gly Ala Ala 


Phe Phe Asp lie Asp Gin 




50 








55 


60 


Cys 


Ser Asp 


Arg 


Thr 


Ser 


Pro Tyr Asp His 


Met Leu Pro Gly Ala Glu 


65 








70 




75 80 
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His 


Phe Ala 


Glu 


Tyr 


Ala 


Gly 


Arg 


Leu 


Gly Val 


Gly Ala Ala Thr His 








85 










90 


95 


Val 


Val He 


Tyr 


Asp 


Ala 


Ser 


Asp 


Gin 


Gly Leu 


Tyr Ser Ala Pro Arg 






100 










105 




110 


Val 


Trp Trp 


Met 


Phe 


Arg 


Ala 


Phe 


Gly 


His His 


Ala Val Ser Leu Leu 




115 










120 






125 


Asp 


Gly Gly 


Leu 


Arg 


His 


Trp 


Leu 









130 135 



<210> 124 
<211> 133 
<212> PRT 
<213> Homo sapien 

<400> 124 



Met 


Asn 


His 


Thr 


Val 


Gin 


Thr 


Phe 


Phe 


Ser 


Pro 


Val Asn Ser Gly Gin 


1 








5 










10 




15 


Pro 


Pro 


Asn 


Tyr 
20 


Glu 


Met 


Leu 


Lys 


Glu 
25 


Glu 


His 


Glu Val Ala Val Leu 
30 


Gly 


Ala 


Pro 
35 


His 


Asn 


Pro 


Ala 


Pro 
40 


Pro 


Thr 


Ser 


Thr Val He His He 
45 


Arg 


Ser 
50 


Glu 


Thr 


Ser 


Val 


Pro 
55 


Asp 


His 


Val 


Val 


Trp Ser Leu Phe Asn 
60 


Thr 


Leu 


Phe 


Met 


Asn 


Pro 


Cys 


Cys 


Leu 


Gly Phe 


He Ala Phe Ala Tyr 


65 










70 










75 


80 


Ser 


Val 


Lys 


Ser 


Arg 
85 


Asp 


Arg 


Lys 


Met 


Val 
90 


Gly 


Asp Val Thr Gly Ala 
95 


Gin 


Ala 


Tyr 


Ala 
100 


Ser 


Thr 


Ala 


Lys 


Cys 
105 


Leu 


Asn 


He Trp Ala Leu He 
110 


Leu 


Gly 


He 
115 


Leu 


Met 


Thr 


He 


Leu 
120 


Leu 


He 


Val 


He Pro Val Leu He 
125 


Phe 


Gin 
130 


Ala 


Tyr 


Gly 

















<210> 125 
<211> 195 
<212> PRT 
<213> Homo sapien 

<400> 125 



Thr 


Thr 


Ala 


Thr 


Thr 


Thr 


Ala 


Ser 


Thr 


Gly Ser 


Thr Ala Thr Pro Ser 


1 








5 










10 


15 


Ser 


Thr 


Pro 


Gly 


Thr 


Ala 


Pro 


Pro 


Pro 


Lys Val 


Leu Thr Ser Pro Ala 








20 










25 




30 


Thr 


Thr 


Pro 


Met 


Ser 


Thr 


Met 


Ser 


Thr 


He His 


Thr Ser Ser Thr Pro 
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40 






45 


Glu 


Thr 


Thr 


His 


Thr 


Ser 


Thr 


Val 


Leu 


Thr Thr 


Thr Ala Thr Met Thr 




50 
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60 


Arg 


Ala 


Thr 


Asn 


Ser 


Thr 


Ala 


Thr 


Pro 


Ser Ser 


Thr Leu Gly Thr Thr 
80 


65 










70 








75 


Arg 


He 


Leu 


Thr 


Glu 


Leu 


Thr 


Thr 


Thr 


Ala Thr 


Thr Thr Ala Ala Thr 










85 










90 


95 


Gly 


Ser 


Thr 


Ala 


Thr 


Leu 


Ser 


Ser 


Thr 


Pro Gly 


Thr Thr Trp He Leu 








100 










105 




110 


Thr 


Glu 


Pro 


Ser 


Thr 


He 


Ala 


Thr 


Val 


Met Val 


Pro Thr Gly Ser Thr 






115 










120 






125 


Ala 


Thr 


Ala 


Ser 


Ser 


Thr 


Leu 


Gly 


Thr 


Ala His 


Thr Pro Lys Val Val 




130 










135 








140 
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Thr Thr Met 


Ala 


Thr Met Pro Thr Ala Thr Ala 


Ser 


Thr 


Val 


Pro Ser 


145 




150 155 








160 


Ser Ser Thr 


Val 
a 


Gly Thr Thr Arg Thr Pro Ala 


V 1 


Leu 


Pro 


SSJT S@r 
















Leu Pro Thr 


Phe 


Ser Val Ser Thr Val Ser Ser 


Ser 


Val 


Leu 


Thr Thr 




180 


185 






190 




Leu Arg Pro 














195 














<210> 


126 












<211> 


509 












<212> 


DNA 












<213> 


Home 


> sapien 










<400> 


126 













gaatteggea egagecaagt accccctgag gaatctgcag cctgcatctg agtacaccgt 60 

atccctcgtg gecataaagg gcaaccaaga gagccccaaa gecactggag tctttaccac 120 

actgcagcct gggagctcta ttccacctta caacaccgag gtgactgaga ccaccattgt 180 

gatcacatgg acgcctgctc caagaattgg ttttaagctg ggtgtacgac caagecaggg 240 

aggagaggca ccacgagaag tgacttcaga ctcaggaagc atcgttgtgt ccggcttgac 300 

tccaggagta gaataegtet acaccatcca agtcctgaga gatggacagg aaagagatgc 360 

gccaattgta aacaaagtgg tgacaccatt gtctccacca acaaacttgc atctggaggc 420 

aaaccctgac actggagtgc tcacagtctc ctggagagga gcaccacccc agacattact 480 

gggtatagaa ttaccacaac ccctacaaa 509 

<210> 127 
<211> 500 
<212> DNA 
<213> Homo sapien 

<400> 127 

gaatteggea cgagccactg atgtccgggg agtcagccag gagcttgggg aagggaagcg 60 

cgcccccggg gccggtcccg gagggctcga tccgcatcta cagcatgagg ttctgcccgt 120 

ttgetgagag gaegegtcta gtcctgaagg ccaagggaat caggcatgaa gtcatcaata 180 

tcaacctgaa aaataagect gagtggttct ttaagaaaaa tccctttggt ctggtgccag 240 

ttctggaaaa cagtcagggt cagctgatct aegagtctge catcacctgt gagtacctgg 300 

atgaagcata cccagggaag aagctgttgc cggatgaccc ctatgagaaa gettgecaga 360 

agatgatctt agagttgttt tctaaggtgc catccttggt aggaagcttt attagaagee 420 

aaaataaaga agactatget ggcctaaaag aagaatttcg taaagaattt accaagctag 480 

aggaggttct gactaataag 500 

<210> 128 
<211> 500 
<212> DNA 
<213> Homo sapien 

<400> 128 

agctttcctc tgctgccgct cggtcacgct tgtgcccgaa ggaggaaaca gtgacagacc 60 

tggagactgc agttctctat ccttcacaca gctctttcac catgcctgga tcacttcctt 120 

tgaatgeaga agettgetgg ccaaaagatg tgggaattgt tgeccttgag atctattttc 180 

cttctcaata tgttgatcaa gcagagttgg aaaaatatga tggtgtagat gctggaaagt 240 

ataccattgg cttgggccag gecaagatgg gcttctgcac agatagagaa gatattaact 300 

etctttgeat gactgtggtt cagaatctta tggagagaaa taacctttcc tatgattgea 360 

ttgggcggct ggaagttgga acagagacaa tcatcgacaa atcaaagtct gtgaagacta 420 

atttgatgea gctgtttgaa gagtctggga atacagatat agaaggaatc gacacaacta 480 

atgeatgeta tggaggcaca 500 



<210> 129 
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<211> 497 
<212> DNA 
<213> Homo sapien 



<400> 129 
gaattcggca cgagcagagg tctccagagc 
taaggaaaaa ctcattgcac cagttgcgga 
cactgtagtg ggtgttggac aagttggtat 
tctggctgat gaacttgctc ttgtggatgt 
ggatctgcag catgggagct tatttcttca 
ttctgtgacc gccaattcta agattgtagt 
ggagagtcgg ctcaatctgg tgcagagaaa 
gatcgtcaag tacagtcctg attgcatcat 
tacgtatgtt acctgga 

<210> 130 
<211> 383 
<212> DNA 
<213> Homo sapien 

<400> 130 

gaattcggca cgagggccgc ggctgccgac tgggtcccct gccgctgtcg ccaccatggc 60 

tccgcaccgc cccgcgcccg cgctgctttg cgcgctgtcc ctggcgctgt gcgcgctgtc 120 

gctgcccgtc cgcgcggcca ctgcgtcgcg gggggcgtcc caggcggggg cgccccaggg 180 

gcgggtgccc gaggcgcggc ccaacagcat ggtggtggaa caccccgagt tcctcaaggc 240 

agggaaggag cctggcctgc agatctggcg tgtggagaaa gttcgatctg gtggcccgtg 300 

cccaccaacc tttatggaga cttcttcacg ggcgacgcct acgtcatcct gaagacagtg 360 

cagcttaaga acggaaaatc ttg 383 

<210> 131 
<211> 509 
<212> DNA 
<213> Homo sapien 

<400> 131 

gaattcggca cgagagtcag ccgcatcttc ttttgcgtcg 
agacaccatg gggaaggtga aggtcggagt caacggattt 
caccagggct gcttttaact ctggtaaagt ggatattgtt 
tgacctcaac tacatggttt acatgttcca atatgattcc 
caccgtcaag gctgagaacg ggaagcttgt catcaatgga 
ggagcgagat ccctccaaaa tcaagtgggg cgatgctggc 
cactggccgt cttcaccacc atggagaagg ctggggctca 
gggtcatcat ctctgccccc tctgctgacg cccccatgtt 
agaagtatga caacagcctc aagatcatc 

<210> 132 
<211> 357 
<212> DNA 
<213> Homo sapien 

<400> 132 

gaattcggca cgagtaagaa gaagccccta gaccacagct 
ggaggatcct cttcttggtg gcagcagcaa caggtgccca 
aatctgggtc tgagttgaag aagcctgggg cctcagtgaa 
gacacatctt cagtatctat ggtttgaatt gggtgcgaca 
agtggatggg atggatcaaa gtcgacactg cgaacccaac 
gacgatttgt cttctccctg gacacctctg tcagcacggc 



cttctctctc ctgtgcaaaa 
agaagaggca acagttccaa 
ggcgtgtgct atcagcattc 
tttggaagat aagcttaaag 
gacacctaaa attgtggcag 
ggtaactgca ggagtccgtc 
tgttaatgtc ttcaaattca 
aattgtggtt tccaacccag 



tggcaactct 60 

acaataagat 120 

tgggaaagtc 180 

gagaaatgat 240 

ataaagatta 300 

agcaagaagg 360 

ttattcctca 420 

tggacattct 480 
497 



ccagccgagc cacatcgctc 60 

ggtcgtattg ggcgcctggt 120 

gccatcaatg accccttcat 180 

acccatggca aattccatgg 240 

aatcccatca ccatcttcca 300 

gctgagtacg tcgtggagtc 360 

tttgcagggg ggagccaaaa 420 

cgtcatgggt gtgaaccatg 480 
509 



ccacaccatg gactggacct 60 

ctcccaggtg caactggtgc 120 

ggtttcctgc aaggcttctg 180 

ggcccctggt caaggccttg 240 

gtatgcccag ggcttcacag 300 

atatctgcag atcagca 357 
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<210> 133 
<211> 468 
<212> DNA 
<213> Homo sapien 



<400> 133 

gaattcggca cgaggcgccc cgaaccgtcc tcctgctgct ctcggcggcc ctggccctga 60 

ccgagacctg ggccggctcc cactccatga ggtatttcga caccgccatg tcccggcccg 120 

gccgcgggga gccccgcttc atctcagtgg gctacgtgga cgacacgcag ttcgtgaggt 180 

tcgacagcga cgccgcgagt ccgagagagg agccgcgggc gccgtggata gagcaggagg 240 

ggccggagta ttgggaccgg aacacacaga tcttcaagac caacacacag actgaccgag 300 

agagcctgcg gaacctgcgc ggctactaca accagagcga ggccgggtct cacaccctcc 360 

agagcatgta cggctgcgac gtggggccgg acgggcgcct cctccgcggg cataaccagt 420 

acgcctacga cggcaaggat tacatcgccc tgaacgagga cctgcgct 468 

<210> 134 

<211> 214 

<212> DNA 

<213> Homo sapien 



<400> 134 

gaattcggca cgagctgcgt cctgctgagc tctgttctct ccagcacctc ccaacccact 60 

agtgcctggt tctcttgctc caccaggaac aagccaccat gtctcgccag tcaagtgtgt 120 

ccttccggag cgggggcagt cgtagcttca gcaccgcctc tgccatcacc ccgtctgtct 180 

cccgcaccag cttcacctcc gtgtcccggt ccgg 214 

<210> 135 

<211> 355 

<212> DNA 

<213> Homo sapien 



<400> 135 

gaattcggca cgaggtgaac aggacccgtc gccatgggcc gtgtgatccg tggacagagg 60 

aagggcgccg ggtctgtgtt ccgcgcgcac gtgaagcacc gtaaaggcgc tgcgcgcctg 120 

cgcgccgtgg atttcgctga gcggcacggc tacatcaagg gcatcgtcaa ggacatcatc 180 

cacgacccgg gccgcggcgc gcccctcgcc aaggtggtct tccgggatcc gtatcggttt 240 

aagaagcgga cggagctgtt cattgccgcc gagggcattc acacgggcca gtttgtgtat 300 

tgcggcaaga aggcccagct caacattggc aatgtgctcc ctgtgggcac catgc 355 

<210> 136 
<211> 242 
<212> DNA 
<213> Homo sapien 



<400> 136 

gaattcggca cgagccagct cctaaccgcg agtgatccgc cagcctccgc ctcccgaggt 60 

gcccggattg cagacggagt ctccttcact cagtgctcaa tggtgcccag gctggagtgc 120 

agtggtgtga tctcggctcg ctacaacatc cacctcccag cagcctgcct tggcctccca 180 

aagtgccgag attgcagctc tctgcccggc cgccacccct gtctgggaag tgaggatgct 240 

gt 242 



<210> 137 
<211> 424 
<212> DNA 
<213> Homo sapien 



<400> 137 

gaattcggca cgagcccaga tcccgaggtc cgacagcgcc cggcccagat ccccacgcct 



60 
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gccaggagca agccgagagc cagccggccg gcgcactccg 
ttcgacccga gccccgcgcc ctttccggga cccctgcccc 
gccggccatg gagaccccgt cccagcggcg cgccacccgc 
cactccgctg tcgcccaccc gcatcacccg gctgcaggag 
caatgatcgc ttggcggtct acatcgaccg tgtgcgctcg 
gctgcgcctt cgcatcaccg agtctgaaga ggtggtcagc 
ggcc 

<210> 138 
<211> 448 
<212> DNA 
<213> Homo sapien 

<400> 138 

gaattcggca cgagcctgtg ttccaggagc cgaatcagaa atgtcatcct caggcacgcc 60 

agacttacct gtcctactca ccgatttgaa gattcaatat actaagatct tcataaacaa 120 

tgaatggcat gattcagtga gtggcaagaa atttcctgtc tttaatcctg caactgagga 180 

ggagctctgc caggtagaag aaggagataa ggaggatgtt gacaaggcag tgaaggccgc 240 

aagacaggct tttcagattg gatccccgtg gcgtactatg gatgcttccg agagggggcg 300 

actattatac aagttggctg atttaatcga aagagatcgt ctgctgctgg ccgacaatgg 360 

agtcaatgaa tggtggaaaa ctctattcca atgcatatct gaatgattta gcaggctgca 420 

tcaaaacatt gcgctactgt gcaggttg 448 

<210> 139 

<211> 510 

<212> DNA 

<213> Homo sapien 

<400> 139 

gaattcggca cgaggttccg tgcagctcac ggagaagcga atggacaaag tcggcaagta 60 

ccccaaggag ctgcgcaagt gctgcgagga cggcatgcgg gagaacccca tgaggttctc 120 

gtgccagcgc cggacccgtt tcatctccct ggcgaggcgt gcaagaaggt cttcctggac 180 

tgctgcaact acatcacaga gctgcggcgg cagcacgcgc gggccagcca cctggcctgc 240 

caggagtaac ctggatgagg acatcattgc agaagagaac atcgtttccc gaagtgagtt 300 

cccagagagc tggctgtgga acgttgagga cttgaaagag ccaccgaaaa atggaatctc 360 

tacgaagctc atgaatatat ttttgaaaga ctccatcacc acgtgggaga ttctggctgt 420 

gagcatgtcg gacaagaaag ggatctgtgt ggcagacccc ttcgaggtca cagtaatgca 480 

ggacttcttc atcgacctgc ggctacccta 510 

<210> 140 
. <211> 360 
<212> DNA 
<213> Homo sapien 

<400> 140 

gaattcggca cgagcggtaa ctaccccggc tgcgcacagc tcggcgctcc ttcccgctcc 60 

ctcacacacc ggcctcagcc cgcaccggca gtagaagatg gtgaaagaaa caacttacta 120 

cgatgttttg ggggtcaaac ccaatgctac tcaggaagaa ttgaaaaagg cttataggaa 180 

actggctttg aagtaccatc ctgataagaa cccaaatgaa ggagagaagt ttaaacagat 240 

ttctcaagct tacgaagttc tctctgatgc aaagaaaagg gaattatatg acaaaggagg 300 

agaacaggca attaaagagg gtggagcagg tggcggtttt ggctccccca tggacatctt 360 

<210> 141 
<211> 483 
<212> DNA 
<213> Homo sapien 



actccgagca gtctctgtcc 120 

gcgggcagcg ctgccaacct 180 

agcggggcgc aggccagctc 240 

aaggaggacc tgcaggagct 300 

ctggaaacgg agaacgcagg 360 

cgcgaggtgt ccggcatcaa 420 
424 



<400> 141 
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gaattcggca cgagagcaga ggctgatctt tgctggaaaa cagctggaag atgggctgca 60 

ccctgtctga ctacaacatc cagaaagagt ccaccctgca cctggtgctc cgtctcagag 120 

gtgggatgca aatcttcgtg aagacactca ctggcaagac catcaccctt gaggtggagc 180 

ccagtgacac catcgagaac gtcaaagcaa agatccagga caaggaaggc attcctcctg 240 

accagcagag gttgatcttt gccggaaagc agctggaaga tgggcgcacc ctgtctgact 300 

acaacatcca gaaagagtct accctgcacc tggtgctccg tctcagaggt gggatgcaga 360 

tcttcgtgaa gaccctgact ggtaagacca tcaccctcga ggtggagccc agtgacacca 420 

tcgagaatgt caaggcaaag atccaagata aggaaggcat tcctcctgat cagcagaggt 480 

tga 483 

<210> 142 
<211> 500 
<212> DNA 
<213> Homo sapien 

<400> 142 

gaattcggca cgaggcggcg acgaccgccg ggagcgtgtg cagcggcggc ggcggaagtg 60 

gccggcgagc ccggtccccg ccggcaccat gcttcccttg tcactgctga agacggctca 120 

gaatcacccc atgttggtgg agctgaaaaa tggggagacg tacaatggac acctggtgag 180 

ctgcgacaac tggatgaaca ttaacctgcg agaagtcatc tgcacgtcca gggacgggga 240 

caagttctgg cggatgcccg agtgctacat ccgcggcagc accatcaagt acctgcgcat 300 

ccccgacgag atcatcgaca tggtcaagga ggaggtggtg gccaagggcc gcggccgcgg 360 

aggcctgcag cagcagaagc agcagaaagg ccgcggcatg ggcggcgctg gccgaggtgt 420 

gtttggtggc cggggccgag gtgggatccc gggcacaggc agaagccagc cagagaagaa 480 

gcctggcaga caggcgggca 500 

<210> 143 
<211> 400 
<212> DNA 
<213> Homo sapien 

<400> 143 

gaattcggca cgagctcgga tgtcagcagg cgtcccaacc cagcaggaac tggctcaatt 60 

ctcagaagaa agcgatcggc cccgaggcag gaaggccggc tccggtgcag ggcgcgccgc 120 

ctgcgggctg cttcgggcca gggtcgaccc gagggccagc gcaagcagcg gcaacaggag 180 

cgccaggagg acatgaggct ctgcctgcag tcagcaactt ggaatattca gacttcagac 240 

cagcatcaca gattataacc ctccgtaaat catctgcatc ccagctccca tcaaaagcca 300 

gcctgaagga cccatggaca cgtgactcca gtgttctcaa caacatctta gatcaagttg 360 

gtttgcacaa catttgcatc tacttgggac aaagcaagaa 400 

<210> 144 

<211> 243 

<212> DNA 

<213> Homo sapien 

<400> 144 

gaattcggca cgagccagct cctaaccgcg agtgatccgc cagcctccgc ctcccgaggt 60 

gcccggattg cagacggagt ctccttcact cagtgctcaa tggtgcccag gctggagtgc 120 

agtggtgtga tctcggctcg ctacaacatc cacctcccag cagcctgcct tggcctccca 180 

aagtgccgag attgcagcct ctgcccggcc gtcaccccgt ctgggaagtg aggagcgttt 240 

ctg " 243 

<210> 145 
<211> 450 
<212> DNA 
<213> Homo sapien 



<400> 145 
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gaattcggca cgaggacagc aggaccgtgg aggccgcggc 

cggcggcggc ggcggtggtg gttacaaccg cagcagtggt 

tggaggtggc cgtggaggca gaggtggcat gggcggaagt 

atttggtggc cctcgggacc aaggatcacg tcatgactcc 

caacaccatc tttgtgcaag gcctgggtga gaatgttaca 

cttcaagcag attggtatta ttaagacaaa caagaaaacg 

gtacacagac agggaaactg gcaagctgaa gggagaggca 

accttcagct aaagcagcct attgactggt 

<210> 146 
<211> 451 
<212> DNA 
<213> Homo sapien 

<400> 146 

gaattcggca cgagccatcg agtccctgcc tttcgacttg 
gcgggagatc gacgcgaaat accaagagat cctgaaggag 
cttcagtcgc gagacagacg gggcgcagaa gcggcggatg 
gctgatccgc accaggagct gggcgacgag aagatccaga 
ctggtggaga accgcacgcg gcaggtggac agccacgtgg 
gagctgggcg acacagcggg caacagcggc aaggctggcg 
gcggcagcgc aggctgacaa gcccaacagc aagcgctcac 
aaccgtgaga acgcgtccag caaccacgac c 

<210> 147 
<211> 400 
<212> DNA 
<213> Homo sapien 

<400> 147 
gaattcggca cgagctcgga tgtcagcagg 
ctcagaagaa agcgatcggc cccgaggcag 
ctgcgggctg cttcgggcca gggtcgaccc 
cgccaggagg acatgaggct ctgcctgcag 
cagcatcaca gattataacc ctccgtaaat 
gcctgaagga cccatggaca cgtgactcca 
gtttgcacaa catttgcatc tacttgggac 

<210> 148 
<211> 503 
<212> DNA 
<213> Homo sapien 

<400> 148 

aaaagaattc ggcacgagcg gcgccgctca tccccctctc ccagcagatt cccactggaa 60 

attcgttgta tgaatcttat tacaagcagg tcgatccggc atacacaggg agggtggggg 120 

cgagtgaagc tgcgcttttt ctaaagaagt ctggcctctc ggacattatc cttgggaaga 180 

tatgggactt ggccgatcca gaaggtaaag ggttcttgga caaacagggt ttctatgttg 240 

cactgagact ggtggcctgt gcacagagtg gccatgaagt taccttgagc aatctgaatt 300 

tgagcatgcc accgcctaaa tttcacgaca ccagcagccc tctgatggtc acaccgccct 360 

ctgcagaggc ccactgggct gtgagggtgg aagaaaaggc caaatttgat gggatttttg 420 

aaagcctctt gcccatcaat ggtttgctct ctggagacaa agtcaagcca gtcctcatga 480 

actcaaagct gcctcttgat gtc 503 

<210> 149 

<211> 1061 

<212> DNA 

<213> Homo sapien 



aggggtggca gtggtggcgg 60 

ggctatgaac ccagaggtcg 120 

gaccgtggtg gcttcaataa 180 

gaacaggata attcagacaa 240 

attgagtctg tggctgatta 300 

ggacagccca tgattaattt 360 

acggtctctt ttgatgaccc 420 
450 



cagagaaatg tctcgctgat 60 

ctagacgagt gctacgagcg 120 

ctgcactgtg tgcagcgcgc 180 

tcgtgagcca gatggtggag 240 

agctgttcga ggcgcagcag 300 

cggacaggcc caaaggcgag 360 

ggcggcagcg caacaacgag 420 
451 



cgtcccaacc cagcaggaac 
gaaggccggc tccggtgcag 
gagggccagc gcaagcagcg 
tcagcaactt ggaatattca 
catctgcatc ccagctccca 
gtgttctcaa caacatctta 
aaagcaagaa 



tggctcaatt 60 

ggcgcgccgc 120 

gcaacaggag 180 

gacttcagac 240 

tcaaaagcca 300 

gatcaagttg 360 
400 
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<400> 149 

gaattcggca cgaggccttt tccagcaacc ccaaggtcca ggtggaggcc atcgaagggg 60 

gagccctgca gaagctgctg gtcatcctgg ccacggagca gccgctcact gcaaagaaga 120 

aggtcctgtt tgcactgtgc tccctgctgc gccacttccc ctatgcccag cggcagttcc 180 

tgaagctcgg ggggctgcag gtcctgagga ccctggtgca ggagaagggc acggaggtgc 240 

tcgccgtgcg cgtggtcaca ctgctctacg acctggtcac ggagaagatg ttcgccgagg 300 

aggaggctga gctgacccag gagatgtccc cagagaagct gcagcagtat cgccaggtac 360 

acctcctgcc aggcctgtgg gaacagggct ggtgcgagat cacggcccac ctcctggcgc 420 

tgcccgagca tgatgcccgt gagaaggtgc tgcagacact gggcgtcctc ctgaccacct 480 

gccgggaccg ctaccgtcag gacccccagc tcggcaggac actggccagc ctgcaggctg 540 

agtaccaggt gctggccagc ctggagctgc aggatggtga ggacgagggc tacttccagg 600 

agctgctggg ctctgtcaac agcttgctga aggagctgag atgaggcccc acaccagtac 660 

tggactggga tgccgctagt gaggctgagg ggtgccagcg tgggtgggct tctcaggcag 720 

gaggacatct tggcagtgct ggcttggcca ttaaatggaa acctgaaggc catcctcttt 780 

ctgctgtgtg tctgtgtaga ctgggcacag ccctgtggcc ggggggtcag gtgagtggtt 840 

gggtgatggg ctctgctgac gtgcagggct cagcccaggg catccaggaa caggctccag 900 

ggcaggaacc tgggcccagg agttgcaagt ctctgcttct taccaagcag cagctctgta 960 

ccttgggaag tcgcttaatt gctctgagct tgtttcctca tctgtcagga gtgccattaa 1020 

aggagaaaaa tcacgtaaaa aaaaaaaaaa aaaaactcga g 1061 

<210> 150 
<211> 781 
<212> DNA 
<213> Homo sapien 



<400> 150 

gaattcggca cgagaaatgg cggcaggggt cgaagcggca gccgaagtgg cggcgacaga 60 

acccaaaatg gaggaagaga gcggcgcgcc ctgcgtgccg agcggcaacg gagctccggg 120 

cccgaagggt gaagaacgac ctactcagaa tgagaagagg aaggagaaaa acataaaaag 180 

aggaggcaat cgctttgagc catattccaa cccaactaaa agatacagag ccttcattac 240 

aaatatacct tttgatgtga aatggcagtc acttaaagac ctggttaaag aaaaagttgg 300 

tgaggtaaca tacgtggagc tcttaatgga cgctgaagga aagtcaaggg gatgtgctgt 360 

tgttgaattc aagatggagg agagcatgaa aaaagctgct gaagttctaa acaagcatag 420 

tctgagtgga aggccactga aagtcaagga agatcctgat ggtgaacatg caaggagagc 480 

aatgcaaaag gctggaagac ttggaagcac agtatttgta gcaaatctgg attataaagt 540 

tggctggaag aaactgaagg aagtatttag tatggctggt gtggtggtcc gagcagacat 600 

tctggaagat aaagatggga aaagtcgtgg aataggcatt gtgacttttg aacagtccat 660 

tgaagctgtg caagcaatat ctatgtttaa tggccagttg ctgtttgata gaccgatgca 720 

cgtcaagatg gatgagaggg ctttaccaaa gggagacttt tttcctcctg aacgccacag 780 

c 781 



<210> 151 
<211> 3275 
<212> DNA 
<213> Homo sapien 



<400> 151 

cttaagtgga tcctgcatca ggagggagca gacaccggag aaagaaaaac aagttgtgct 60 

gtttgaggaa gcaagttgga cctgcactcc agcctgtgga gatgaaccta ggactgtgat 120 

tctgctatcc agtatgttgg ctgaccacag gctcaaactg gaggattata aggatcgcct 180 

gaaaagtgga gagcatctta atccagacca gttggaagct gtagagaaat atgaagaagt 240 

gctacataat ttggaatttg ccaaggagct tcaaaaaacc ttttctgggt tgagcctaga 300 

tctactaaaa gcgcaaaaga aggcccagag aagggagcac atgctaaaac ttgaggctga 360 

gaagaaaaag cttcgaacta tacttcaagt tcagtatgta ttgcagaact tgacacagga 420 

gcacgtacaa aaagacttca aagggggttt gaatggtgca gtgtatttgc cttcaaaaga 480 

acttgactac ctcattaagt tttcaaaact gacctgccct gaaagaaatg aaagtctgag 540 

acaaacactt gaaggatcta ctgtctaaat tgctgaactc aggctatttt gaaagtatcc 600 
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cagttcccaa aaatgccaag gaaaaggaag taccactgga ggaagaaatg ctaatacaat 660 

cagagaaaaa aacacaatta tcgaagactg aatctgtcaa agagtcagag tctctaatgg 720 

aatttgccca gccagagata caaccacaag agtttcttaa cagacgctat atgacagaag 780 

tagattattc aaacaaacaa ggcgaagagc aaccttggga agcagattat gctagaaaac 840 

caaatctccc aaaacgttgg gatatgctta ctgaaccaga tggtcaagag aagaaacagg 900 

agtcctttaa gtcctgggag gcttctggta agcaccagga ggtatccaag cctgcagttt 960 

ccttagaaca gaggaaacaa gacacctcaa aactcaggtc tactctgccg gaagagcaga 1020 

agaagcagga gatctccaaa tccaagccat ctcctagcca gtggaagcaa gatacaccta 1080 

aatccaaagc agggtatgtt caagaggaac aaaagaaaca ggagacacca aagctgtggc 1140 

cagttcagct gcagaaagaa caagatccaa agaagcaaac tccaaagtct tggacacctt 1200 

ccatgcagag cgaacagaac accaccaagt catggaccac tcccatgtgt gaagaacagg 1260 

attcaaaaca gccagagact ccaaaatcct gggaaaacaa tgttgagagt caaaaacact 1320 

ctttaacatc acagtcacag atttctccaa agtcctgggg agtagctaca gcaagcctca 1380 

taccaaatga ccagctgctg cccaggaagt tgaacacaga acccaaagat gtgcctaagc 1440 

ctgtgcatca gcctgtaggt tcttcctcta cccttccgaa ggatccagta ttgaggaaag 1500 

aaaaactgca ggatctgatg actcagattc aaggaacttg taactttatg caagagtctg 1560 

ttcttgactt tgacaaacct tcaagtgcaa ttccaacgtc acaaccgcct tcagctactc 1620 

caggtagccc cgtagcatct aaagaacaaa atctgtccag tcaaagtgat tttcttcaag 1680 

agccgttaca ggtatttaac gttaatgcac ctctgcctcc acgaaaagaa caagaaataa 1740 

aagaatcccc ttattcacct ggctacaatc aaagttttac cacagcaagt acacaaacac 1800 

caccccagtg ccaactgcca tctatacatg tagaacaaac tgtccattct caagagactg 1860 

cagcaaatta tcatcctgat ggaactattc aagtaagcaa tggtagcctt gccttttacc 1920 

cagcacagac gaatgtgttt cccagaccta ctcagccatt tgtcaatagc cggggatctg 1980 

ttagaggatg tactcgtggt gggagattaa taaccaattc ctatcggtcc cctggtggtt 2040 

ataaaggttt tgatacttat agaggactcc cttcaatttc caatggaaat tatagccagc 2100 

tgcagttcca agctagagag tattctggag caccttattc ccaaagggat aatttccagc 2160 

agtgttataa gcgaggaggg acatctggtg gtccacgagc aaattcgaga gcagggtgga 2220 

gtgattcttc tcaggtgagc agcccagaaa gagacaacga aacctttaac agtggtgact 2280 

ctggacaagg agactcccgt agcatgaccc ctgtggatgt gccagtgaca aatccagcag 2340 

ccaccatact gccagtacac gtctaccctc tgcctcagca gatgcgagtt gccttctcag 2400 

cagccagaac ctctaatctg gcccctggaa ctttagacca acctattgtg tttgatcttc 2460 

ttctgaacaa cttaggagaa acttttgatc ttcagcttgg tagatttaat tgcccagtga 2520 

atggcactta cgttttcatt tttcacatgc taaagctggc agtgaatgtg ccactgtatg 2580 

tcaacctcat gaagaatgaa gaggtcttgg tatcagccta tgccaatgat ggtgctccag 2640 

accatgaaac tgctagcaat catgcaattc ttcagctctt ccagggagac cagatatggt 2700 

tacgtctgca caggggagca atttatggaa gtagctggaa atattctacg ttttcaggct 2760 

atcttcttta tcaagattga aagtcagtac agtattgaca ataaaaggat ggtgttctaa 2820 

ttagtgggat tgaaggaaaa gtagtctttg ccctcatgac tgattggttt aggaaaatgt 2880 

ttttgttcct agagggagga ggtccttact tttttgtttt ccttcctgag gtgaaaaatc 2940 

aagctgaatg acaattagca ctaatctggc actttataaa ttgtgatgta gcctcgctag 3000 

tcaagctgtg aatgtatatt gtttgcactt aatccttaac tgtattaacg ttcagcttac 3060 

taaactgact gcctcaagtc caggcaagtt acaatgcctt gttgtgcctc aataaaaaag 3120 

ttacatgcaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 3180 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 3240 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaac tcgag 3275 

<210> 152 
<211> 2179 
<212> DNA 
<213> Homo sapien 

<400> 152 

gaattcggca ccaggcacta ttaaatgtga ggcagcctcc atctactaca acatttgtgc 60 

tgaatcaaat aaatcatctt ccacccttgg gatctacaat tgtaatgact aaaacaccac 120 

ctgtaacaac caacaggcaa accatcactt taactaagtt tatccagact actgcaagca 180 

cacgcccgtc agtctcagca ccaacagtac gaaatgccat gacctctgca ccttcaaaag 240 

accaagttca gcttaaagat ctactgaaaa ataatagtct taatgaactg atgaaactaa 300 

agccacctgc taatattgct cagccagtag caacagcagc tactgatgta agcaatggta 360 
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cagtaaagaa agagtcttct aataaagaag gagctagaat gtggataaac gacatgaaga 420 

tgaggagttt ttccccaacc atgaaggttc ctgttgtaaa agaagatgat gaaccagagg 480 

aagaagatga agaagaaatg ggtcatgcag aaacctatgc agaatacatg ccaataaaat 540 

taaaaattgg cctacgtcat ccagatgctg tagtggaaac cagctcttta tccagtgtta 600 

ctcctcctga tgtttggtac aaaacatcca tttctgagga aaccattgat aatggctggt 660 

tatcagcatt gcagcttgag gcaattacat atgcagccca gcaacatgaa actttcctac 720 

ctaatggaga tcgtgctggc ttcttaatag gtgatggtgc cggtgtagga aaaggaagga 780 

cgatagcagg aatcatctat gaaaattatt tgttgagtag aaaacgagca ttgtggttta 840 

gtgtttcaaa tgacttaaag tatgatgctg aaagagattt aagggatatt ggagcaaaaa 900 

acattttggt tcattcgtta aataagttta aatacggaaa aatttcttcc aaacataatg 960 

ggagtgtgaa aaagggtgtt atttttgcta cttactcttc acttattggt gaaagccagt 1020 

ctggcggcaa gtataaaact aggttaaaac aacttctgca ttggtgcggt gatgacttcg 1080 

atggagtgat agtgtttgat gagtgtcata aagccaaaaa cttatgtcct gttggttctt 1140 

caaagccaac caagacaggc ttagcagttt tagagcttca gaacaaattg ccaaaagcca 1200 

gagttgttta tgctagtgca actggtgctt ctgaaccacg caacatggcc tatatgaacc 1260 

gtcttggcat atggggtgag ggtactccat ttagagaatt cagtgatttt attcaagcag 1320 

tagaacggag aggagttggt gccatggaaa tagttgctat ggatatgaag cttagaggaa 1380 

tgtacattgc tcgacaactg agctttactg gagtgacctt caaaattgag gaagttcttc 1440 

tttctcagag ctacgttaaa atgtataaca aagctgtcaa gctgtgggtc attgccagag 1500 

agcggtttca gcaagctgca gatctgattg atgctgagca acgaatgaag aagtccatgt 1560 

ggggtcagtt ctggtctgct caccagaggt tcttcaaata cttatgcata gcatccaaag 1620 

ttaaaagggt tgtgcaacta gctcgagagg aaatcaagaa tggaaaatgt gttgtaattg 1680 

gtctgcagtc tacaggagaa gctagaacat tagaagcttt ggaagagggc gggggagaat 1740 

tgaatgattt tgtttcaact gccaaaggtg tgttgcagtc actcattgaa aaacattttc 1800 

ctgctccaga caggaaaaaa ctttatagtt tactaggaat cgatttgaca gctccaagta 1860 

acaacagttc gccaagagat agtccttgta aagaaaataa aataaagaag cggaaaggtg 1920 

aagaaataac tcgagaagcc aaaaaagcac gaaaagtagg tggccttact ggtagcagtt 1980 

ctgacgacag tggaagtgaa tctgatgcct ctgataatga agaaagtgac tatgagagct 2040 

ctaaaaacat gagttctgga gatgatgacg atttcaaccc atttttagat gagtctaatg 2100 

aggatgatga aaatgatccc tggttaatta aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 2160 

aaaaaaaaaa aaactcgag 2179 

<210> 153 
<211> 2109 
<212> DNA 
<213> Homo sapien 

<400> 153 

cagagagccc caggcatcga ggagaaggcg gcggagaatg gggccctggg gtcccccgag 60 

agagaagaga aagtgctgga gaatggggag ctgacacccc caaggaggga ggagaaagcg 120 

ctggagaatg gggagctgag gtccccagag gccggggaga aggtgctggt gaatgggggc 180 

ctgacacccc caaagagcga ggacaaggtg tcagagaatg ggggcctgag attccccagg 240 

aacacggaga ggccaccaga gactgggcct tggagagccc cagggccctg ggagaagacg 300 

cccgagagtt ggggtccagc ccccacgatc ggggagccag ccccagagac ctctctggag 360 

agagcccctg cacccagcgc agtggtctcc tcccggaacg gcggggagac agcccctggc 420 

ccccttggcc cagcccccaa gaacgggacg ctggaacccg ggaccgagag gagagccccc 480 

gagactgggg gggcgccgag agccccaggg gctgggaggc tggacctcgg gagtgggggc 540 

cgagccccag tgggcacggg gacggccccc ggcggcggcc ccggaagcgg cgtggacgca 600 

aaggccggat gggtagacaa cacgaggccg cagccaccgc cgccaccgct gccaccgcca 660 

ccggaggcac agccgaggag gctggagcca gcgcccccga gagccaggcc ggaggtggcc 720 

cccgagggag agcccggggc cccagacagc agggccggcg gagacacggc actcagcgga 780 

gacggggacc cccccaagcc cgagaggaag ggccccgaga tgccacgact attcttggac 840 

ttgggacccc ctcaggggaa cagcgagcag atcaaagcca ggctctcccg gctctcgctg 900 

gcgctgccgc cgctcacgct cacgccattc ccggggccgg gcccgcggcg gcccccgtgg 960 

gagggcgcgg acgccggggc ggctggcggg gaggccggcg gggcgggagc gccggggccg 1020 

gcggaggagg acggggagga cgaggacgag gacgaggagg aggacgagga ggcggcggcg 1080 

ccgggcgcgg cggcggggcc gcggggcccc gggagggcgc gagcagcccc ggtgcccgtc 1140 

gtggtgagca gcgccgacgc ggacgcggcc cgcccgctgc gggggctgct caagtctccg 1200 
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cgcggggccg acgagccaga ggacagcgag ctggagagga agcgcaagat ggtctccttc 1260 

cacggggacg tgaccgtcta cctcttcgac caggagacgc caaccaacga gctgagcgtc 1320 

caggcccccc ccgaggggga cacggacccg tcaacgcctc cagcgccccc gacacctccc 1380 

caccccgcca cccccggaga tgggtttccc agcaacgaca gcggctttgg aggcagtttc 1440 

gagtgggcgg aggatttccc cctcctcccc cctccaggcc ccccgctgtg cttctcccgc 1500 

ttctccgtct cgcctgcgct ggagaccccg gggccacccg cccgggcccc cgacgcccgg 1560 

cccgcaggcc ccgtggagaa ttgattcccc gaagacccga ccccgctgca ccctcagaag 1620 

aggggttgag aatggaatcc tctgtggatg acggcgccac tgccaccacc gcagacgccg 1680 

cctctgggga ggcccccgag gctgggccct ccccctccca ctcccctacc atgtgccaaa 1740 

cgggaggccc cgggcccccg cccccccagc cccccagatg gctcccctga cccccctgac 1800 

cccctcggag ccaaatgagg caggaatccc cccgcccctc catagagagc cgcctttctc 1860 

ggaactgaac tgaactcttt tgggcctgga gcccctcgac acagcggagg tccctcctca 1920 

cccactcctg gcccaagaca ggggccgcag gcttcgggga cccggacccc ccatttcgcg 1980 

tctccccttt ccctccccag cccggcccct ggaggggcct ctggttcaaa ccttcgcgtg 2040 

gcattttcac attatttaaa aaagacaaaa acaacttttt ggaggaaaaa aaaaaaaaaa 2100 

aaactcgag 2109 

<210> 154 
<211> 1411 
<212> DNA 
<213> Homo sapien 

<400> 154 

gaattcggca ccaggggaga tgaggaagtt cgatgttcct agcatggagt ctacccttaa 60 

ccagccagcc atgctagaga cgttatactc agatccacat taccgagccc atttccccaa 120 

cccaagacct gatacaaata aggatgtata caaagtattg ccagaatcca agaaggcacc 180 

gggcagtggt gcagtatttg agaggaacgg accacatgct agcagtagtg gggtgctccc 240 

tttgggactc cagcctgcgc ctggactt.tc caagtcacta tcctctcagg tgtggcaacc 300 

aagtcctgac ccttggcatc ctggagaaca atcctgtgaa ctcagtactt gtcgacagca 360 

gttggaattg atccgtttac agatggagca aatgcagctt cagaacggag ccatgtgtca 420 

ccatcctgct gctttcgctc cattactgcc caccctagag ccagcacagt ggctcagcat 480 

cctgaacagt aacgagcatc tcctgaagga gaaggagctc ctcattgaca agcaaaggaa 540 

gcatatctct cagctggagc agaaagtgcg agagagtgaa ctgcaagtcc acagtgccct 600 

tttgggccgc cctgccccct ttggggatgt ctgcttattg aggctacagg agttgcagcg 660 

agagaacact ttcttacggg cacagtttgc acagaagaca gaagccctga gcaaggagaa 720 

gatggagctt gaaaagaaac tctctgcatc tgaagttgaa attcagctca ttagggagtc 780 

tctaaaagtg acactacaga agcattcgga ggaggggaag aaacaggagg aaagggtcaa 840 

aggtcgtgat aaacatatca ataatttgaa aaagaaatgt cagaaggaat cagagcagaa 900 

ccgggagaag cagcagcgta ttgaaacctt ggagcgctat ctagctgacc tgcccaccct 960 

agaagaccat cagaaacaga cggagcagct taaggacgct gaattaaaga acacagaact 1020 

gcaagagaga gtggctgagc tggagacttt gctggaggac acccaggcaa cctgcagaga 1080 

gaaggaggtt cagctggaaa gtctgagaca aagagaagca gacctctcct ctgctagaca 1140 

taggtaatgc cctgtgtact tgggggaagg agggagttcg gttctggtgc tctgttaact 1200 

cttgtgtgtt caacagtgtt catttcaagt tcctttcttc taagagcttt gtgttctttg 1260 

aattgaaagt cacttatggc cgggtgtggt ggcgcacacc tttaatccca gcacttggga 1320 

gtcagaggca ggctaatttc tgagtttcag gacagccagg gctatacaga gaaaccctgt 1380 

ctcaaacaaa aaaaaaaaaa aaaaactcga g 1411 

<210> 155 
<211> 678 
<212> DNA 
<213> Homo sapien 

<400> 155 

ctggagtgaa gggagctagt ggtaaaggga gctggtggag gggtggcggc aggggtaagg 60 

ggcaggggac accctctaga cggagagcgg gctccgaggt cctggctggc cctcggtgcg 120 

cccgcccctg tgttggtccc acaatccctg gcaatgagag gccagggttt attggacaga 180 

gtcagttgtg gggttcagag ggtcagcaat caatcaatcc tccgaatcca gagatttaga 240 
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cccagtcgtc cgtattagga 
ggaacctgtc ttttgatttg 
ccccctaggc acagggggtg 
cgaggcattc ttcgtgctca 
ggaagcccac cgacgccagc 
gcaatccctc aactacggct 
tgtggagaaa cccagcacat 
tcctgaagcc ttgaagcc 



ctggaggggg gtcaataggt 
gggttcaaca tacagagttc 
gtctttacca ttttcgagac 
atgctgatgt ctgctccgac 
gtcacccttt cttactcctt 
gcatcgttga gaatccacag 
ttatcagtga catcatcaac 



tcagtgtttg agatgccaag 
aggtacctgc aggaatttgc 
cagatcctgg ctgggagccc 
ttccccttga gtgctatgtt 
ggcactacgg ctaacaggac 
acacacgagg tattgcacta 
tgcggcacct acctcttttc 



300 
360 
420 
480 
540 
600 
660 
678 



<210> 156 
<211> 2668 
<212> DNA 
<213> Homo sapien 

<400> 156 

gggaaggcgg ctgcgctgct gggcgggggc gggagctgga gccggagctg gagccggggc 60 

cggggcccgg gtcagcgctt gagccgggag aagagtttga gatcgtggac cgaagccagc 120 

tgcccggccc aggcgacctg cggagcgcaa cgaggccgcg ggcggccgag ggctggtcgg 180 

cgcccatcct gaccctggca cgcagggcca ccgggaacct gtcggcgagc tgcgggagcg 240 

cgctgcgcgc ggccgcgggg ctgggcggcg gggacagcgg ggacggcacg gcgcgcgcag 300 

cttctaagtg ccagatgatg gaggagcgtg ccaacctgat gcacatgatg aaactcagca 360 

tcaaggtgtt gctccagtcg gctctgagcc tgggccgcag cctggatgcg gaccatgccc 420 

ccttgcagca gttctttgta gtgatggagc actgcctcaa acatgggctg aaagttaaga 480 

agagttttat tggccaaaat aaatcattct ttggtccttt ggagctggtg gagaaacttt 540 

gtccagaagc atcagatata gcgactagtg tcagaaatct tccagaatta aagacagctg 600 

tgggaagagg ccgagcgtgg ctttatcttg cactcatgca aaagaaactg gcagattatc 660 

tgaaagtgct tatagacaat aaacatctct taagcgagtt ctatgagcct gaggctttaa 720 

tgatggagga agaagggatg gtgattgttg gtctgctggt gggactcaat gttctcgatg 780 

ccaatctctg cttgaaagga gaagacttgg attctcaggt tggagtaata gatttttccc 840 

tctaccttaa ggatgtgcag gatcttgatg gtggcaagga gcatgaaaga attactgatg 900 

tccttgatca aaaaaattat gtggaagaac ttaaccggca cttgagctgc acagttgggg 960 

atcttcaaac caagatagat ggcttggaaa agactaactc aaagcttcaa gaagagcttt 1020 

cagctgcaac agaccgaatt tgctcacttc aagaagaaca gcagcagtta agagaacaaa 1080 

atgaattaat tcgagaaaga agtgaaaaga gtgtagagat aacaaaacag gataccaaag 1140 

ttgagctgga gacttacaag caaactcggc aaggtctgga tgaaatgtac agtgatgtgt 1200 

ggaagcagct aaaagaggag aagaaagtcc ggttggaact ggaaaaagaa ctggagttac 1260 

aaattggaat gaaaaccgaa atggaaattg caatgaagtt actggaaaag gacacccacg 1320 

agaagcagga cacactagtt gccctccgcc agcagctgga agaagtcaaa gcgattaatt 1380 

tacagatgtt tcacaaagct cagaatgcag agagcagttt gcagcagaag aatgaagcca 1440 

tcacatcctt tgaaggaaaa accaaccaag ttatgtccag catgaaacaa atggaagaaa 1500 

ggttgcagca ctcggagcgg gcgaggcagg gggctgagga gcggagccac aagctgcagc 1560 

aggagctggg cgggaggatc ggcgccctgc agctgcagct ctcccagctg cacgagcaat 1620 

gctcaagcct ggagaaagaa ttgaaatcag aaaaagagca aagacaggct cttcagcgcg 1680 

aattacagca cgagaaagac acttcctctc tactcaggat ggagctgcaa caagtggaag 1740 

gactgaaaaa ggagttgcgg gagcttcagg acgagaaggc agagctgcag aagatctgcg 1800 

aggagcagga acaagccctc caggaaatgg gcctgcacct cagccagtcc aagctgaaga 1860 

tggaagatat aaaagaagtg aaccaggcac tgaagggcca cgcctggctg aaagatgacg 1920 

aagcgacaca ctgtaggcag tgtgagaagg agttctccat ttcccggaga aagcaccact 1980 

gccggaactg tggccacatc ttctgcaaca cctgctccag caacgagctg gccctgccct 2040 

cctaccccaa gccggtgcga gtgtgcgaca gctgccacac cctgctcctg cagcgctgct 2100 

cctccacggc ctcctgaacg tccgtcctca ggagcacagc ctcacggaca gtgccaaacc 2160 

ctgtgggtct ccaggggctt gggaaatgtg ttctttccca agagtatcaa aggaaagaat 2220 

caaatttctt gcccggtcac tggcactcca gaagacagcg tgccggaacc ggcagctctc 2280 

acctttctgt gacttgttcg gaattaactc ctctggatgg aaacttccat cttacttggt 2340 

tacatcacgg ctctggttca gatacaactt catgattttg ctactatcat ttttcacttt 2400 

tcaaagaatt taacctattt tacagcagtt cagttctgct agtgagtagt tttcctctcc 2460 

taccttcctt ctaaaaacct gattcatgca cagcgtttga cacacatgga gtctgccagt 2520 

gtgccttctc tgcttcagac aagagatctg ccatttcatg cccttgtgac tacctatcat 2580 
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tggccctgca ataaaatcat ttatttttca aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 2640 



<210> 157 
<211> 2313 
<212> DNA 
<213> Homo sapien 

<400> 157 

gaattcggca ccaggccggg cgggcgcctc agccatggcc ctgcgcaagg aactgctcaa 60 

gtccatctgg tacgccttta ccgcgctgga cgtggagaag agtggcaaag tctccaagtc 120 

ccagctcaag gtgctgtccc acaacctgta cacggtcctg cacatccccc atgaccccgt 180 

ggccctggag gaacacttcc gagatgatga tgacggccct gtgtccagcc agggatacat 240 

gccctacctc aacaagtaca tcctggacaa ggtggaggag ggggcttttg ttaaagagca 300 

ctttgatgag ctgtgctgga cgctgacggc caagaagaac tatcgggcag atagcaacgg 360 

gaacagtatg ctctccaatc aggatgcctt ccgcctctgg tgcctcttca acttcctgtc 420 

tgaggacaag taccctctga tcatggttcc tgatgaggtg gaatacctgc tgaaaaaggt 480 

actcagcagc atgagcttgg aggtgagctt gggtgagctg gaggagcttc tggcccagga 540 

ggcccaggtg gcccagacca ccggggggct cagcgtctgg cagttcctgg agctcttcaa 600 

ttcgggccgc tgcctgcggg gcgtgggccg ggacaccctc agcatggcca tccacgaggt 660 

ctaccaggag ctcatccaag atgtcctgaa gcagggctac ctgtggaagc gagggcacct 720 

gagaaggaac tgggccgaac gctggttcca gctgcagccc agctgcctct gctactttgg 780 

gagtgaagag tgcaaagaga aaaggggcat tatcccgctg gatgcacact gctgcgtgga 840 

ggtgctgcca gaccgcgacg gaaagcgctg catgttctgt gtgaagacag ccacccgcac 900 

gtatgagatg agcgcctcag acacgcgcca gcgccaggag tggacagctg ccatccagat 960 

ggcgatccgg ctgcaggccg aggggaagac gtccctacac aaggacctga agcagaaacg 1020 

gcgcgagcag cgggagcagc gggagcggcg ccgggcggcc aaggaagagg agctgctgcg 1080 

gctgcagcag ctgcaggagg agaaggagcg gaagctgcag gagctggagc tgctgcagga 1140 

ggcgcagcgg caggccgagc ggctgctgca ggaggaggag gaacggcgcc gcagccagca 1200 

ccgcgagctg cagcaggcgc tcgagggcca actgcgcgag gcggagcagg cccgggcctc 1260 

catgcaggct gagatggagc tgaaggagga ggaggctgcc cggcagcggc agcgcatcaa 1320 

ggagctggag gagatgcagc agcggttgca ggaggccctg caactagagg tgaaagctcg 1380 

gcgagatgaa gaatctgtgc gaatcgctca gaccagactg ctggaagagg aggaagagaa 1440 

gctgaagcag ttgatgcagc tgaaggagga gcaggagcgc tacatcgaac gggcgcagca 1500 

ggagaaggaa gagctgcagc aggagatggc acagcagagc cgctccctgc agcaggccca 1560 

gcagcagctg gaggaggtgc ggcagaaccg gcagagggct gacgaggatg tggaggctgc 1620 

ccagagaaaa ctgcgccagg ccagcaccaa cgtgaaacac tggaatgtcc agatgaaccg 1680 

gctgatgcat ccaattgagc ctggagataa gcgtccggtc acaagcagct ccttctcagg 1740 

cttccagccc cctctgcttg cccaccgtga ctcctcccta aagcgcctga cccgctgggg 1800 

atcccagggc aacaggaccc cctcgcccaa cagcaatgag cagcagaagt ccctcaatgg 1860 

tggggatgag gctcctgccc cggcttccac ccctcaggaa gataaactgg atccagcacc 1920 

agaaaattag cctctcttag ccccttgttc ttcccaatgt catatccacc aggacctggc 1980 

cacagctggc ctgtgggtga tcccagctct tactaggaga gggagctgag gtcctggtgc 2040 

caggggccca ggccctccaa ccataaacag tccaggatgg aacctggttc acccttcata 2100 

ccagctccaa gccccagacc atgggagctg tctgggatgt tgatccttga gaacttggcc 2160 

ctgtgcttta gacccaagga cccgattcct gggctaggaa agagagaaca agcaagccgg 2220 

ggctacctgc ccccaggtgg ccaccaagtt gtggaagcac atttctaaat aaaaactgct 2280 

cttagaatga aaaaaaaaaa aaaaaaactc gag 2313 

<210> 158 
<211> 2114 
<212> DNA 
<213> Homo sapien 



aaaaaaaaaa aaaaaaaaaa aactcgag 



2668 



<400> 158 



gaattcggca cgaggaagaa ctcgcctctg ttgagtgtaa 
gagaataaca gaaatgtcca tttggagcac tcagagcaga 
gacacctcag cagcgcacca ggtggtttta ggagaaaact 



gtagccaaac aataaccaag 
atcctggttc atcagcaggt 
tgatagccac agccctttgt 



60 
120 
180 
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ctttctggca gtgggtctca gtctgatttg aaggatgtgg ccagcacagc aggagaggag 240 

ggggacacaa gccttcggga gagcctccat ccagtcactc ggtctcttaa ggcagggtgc 300 

catactaagc agcttgcctc caggaattgc tctgaagaga aatccccaca aacctccatc 360 

ctaaaggaag gtaacaggga cacaagcttg gatttccgac ctgtagtgtc tccagcaaat 420 

ggggttgaag gagtccgagt ggatcaggat gatgatcaag atagctcttc cctgaagctt 480 

tctcagaaca ttgctgtaca gactgacttt aagacagctg attcagaggt aaacacagat 540 

caagatattg aaaagaattt ggataaaatg atgacagaga gaaccctgtt gaaagagcgt 600 

taccaggagg tcctggacaa acagaggcaa gtggagaatc agctccaagt gcaattaaag 660 

cagcttcagc aaaggagaga agaggaaatg aagaatcacc aggagatatt aaaggctatt 720 

caggatgtga caataaagcg ggaagaaaca aagaagaaga tagagaaaga gaagaaggag 780 

tttttgcaga aggagcagga tctgaaagct gaaattgaga agctttgtga gaagggcaga 840 

agagaggtgt gggaaatgga actggataga ctcaagaatc aggatggcga aataaatagg 900 

aacattatgg aagagactga acgggcctgg aaggcagaga tcttatcact agagagccgg 960 

aaagagttac tggtactgaa actagaagaa gcagaaaaag aggcagaatt gcaccttact 1020 

tacctcaagt caactccccc aacactggag acagttcgtt ccaaacagga gtgggagacg 1080 

agactgaatg gagttcggat aatgaaaaag aatgttcgtg accaatttaa tagtcatatc 1140 

cagttagtga ggaacggagc caagctgagc agccttcctc aaatccctac tcccacttta 1200 

cctccacccc catcagagac agacttcatg cttcaggtgt ttcaacccag tccctctctg 1260 

gctcctcgga tgcccttctc cattgggcag gtcacaatgc ccatggttat gcccagtgca 1320 

gatccccgct ccttgtcttt cccaatcctg aaccctgccc tttcccagcc cagccagcct 1380 

tcctcacccc ttcctggctc ccatggcaga aatagccctg gcttgggttc ccttgtcagc 1440 

cctggtgccg aattcggcac gaggtaccac tggtctgtgt gctagaggag ggtgttgcca 1500 

tagaaccagt ggccacagtt gtggtggtgg tggtcagcac tgtgggggtg tgggtggtcc 1560 

ccgggacgga ggagggggtc accgtgaagc cactggttgt gggtgtggtg gttgtgctga 1620 

tccacactgg aggcgtgcgt gccgtccctg ggctgaagga gggggtgact gtgaagcccg 1680 

tggttgtggt agtcggcact ttggtagtgt gagctgttcc tggggtggaa gagggggtgg 1740 

ccacagagcc ggtggccctg gttgtggtgg ccgtggtggt aagcactgtg gaggtgtggg 1800 

cagtctctgg agtggaggag ggtgtggctg tggacatggt ggccgtgggt gtggtggtct 1860 

gtgataggcg ggtccaggtg gtgcccaggg aggaggaggg gatggctgta aagctggtag 1920 

ctgtgggtgt ggtggctgtg cttctcagtg ctggaagggc ggttgcagtc cctggactgg 1980 

agaagggagt ggctttggag ctggtgactg tgggtgtcgt ggccgtggtg ctcacatgtg 2040 

gggtgccagc agttgcctgg gtggaggagg cggtggccgt ggatccggtg ggcaccgtca 2100 

cgggagtact tcta 2114 

<210> 159 
<211> 278 
<212> DNA 
<213> Homo sapien 

<400> 159 

gaattcggca caggtaactt tgcctggggt atttaaaaaa aaaaaaaaaa aaaaaaaaag 60 

tcaaatatct gagtactaat ttcctgaaaa gtatgttccg atagatgaac agatcattaa 120 

tgcagaatga gaatcactcc taaaataggt aatggtaaaa attaaattga caattacctc 180 

tctctatgca gaaggaaata tcacctatat gacatcatca tcatctattg atacttgctg 240 

gcagtgctaa taatggtttt aatgccaatt tgtaagaa 278 

<210> 160 
<211> 848 
<212> DNA 
<213> Homo sapien 

<400> 160 

gaattcggca cgagccccag aggagctcgg cctgcgctgc gccacgatgt ccggggagtc 60 

agccaggagc ttggggaagg gaagcgcgcc cccggggccg gtcccggagg gctcgatccg 120 

catctacagc atgaggttct gcccgtttgc tgagaggacg cgtctagtcc tgaaggccaa 180 

gggaatcagg catgaagtca tcaatatcaa cctgaaaaat aagcctgagt ggttctttaa 240 

gaaaaatccc tttggtctgg tgccagttct ggaaaacagt cagggtcagc tgatctacga 300 

gtctgccatc acctgtgagt acctggatga agcataccca gggaagaagc tgttgccgga 360 
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tgacccctat gagaaagctt gccagaagat gatcttagag ttgttttcta aggtgccajtc 420 

cttggtagga agctttatta gaagccaaaa taaagaagac tatgctggcc taaaagaaga 480 

atttcgtaaa gaatttacca agctagagga ggttctgact aataagaaga cgaccttctt 540 

tggtggcaat tctatctcta tgattgatta cctcatctgg ccctggtttg aacggctgga 600 

agcaatgaag ttaaatgagt gtgtagacca cactccaaaa ctgaaactgt ggatggcagc 660 

catgaaggaa gatcccacag tctcagccct gcttactagt gagaaagact ggcaaggttt 720 

cctagagctc tacttacaga acagccctga ggcctgtgac tatgggctct gaagggggca 780 

ggagtcagca ataaagctat gtctgatatt ttccttcact aaaaaaaaaa aaaaaaaaaa 840 

aactcgag 848 

<210> 161 

<211> 432 

<212> DNA 

<213> Homo sapien 

<400> 161 

gaattcggca cgagggcaga ccaagatcct ggaggaggac ctggaacaga tcaagctgtc 60 

cttgagagag cgaggccggg agctgaccac tcagaggcag ctgatgcagg aacgggcaga 120 

ggaagggaag ggcccaagta aagcacagcg cgggagccta gagcacatga agctgatcct 180 

gcgtgataag gagaaggagg tggaatgtca gcaggagcat atccatgaac tccaggagct 240 

caaagaccag ctggagcagc agctccaggg cctgcacagg aaggtaggtg agaccagcct 300 

cctcctgtcc cagcgagagc aggaaatagt ggtcctgcag cagcaactgc aggaagccag 360 

ggaacaaggg gagctgaagg agcagtcact tcagagtcaa ctggatgagg cccagagagc 420 

cctagcccag ag 432 

<210> 162 
<211> 433 
<212> DNA 
<213> Homo sapien 

<400> 162 

gattcggcac gagccggagc tgggttgctc ctgctcccgt ctccaagtcc tggtacctcc 60 

ttcaagctgg gagagggctc tagtccctgg ttctgaacac tctggggttc tcgggtgcag 120 

gccgccatga gcaaacggaa ggcgccgcag gagactctca acgggggaat caccgacatg 180 

ctcacagaac tcgcaaactt tgagaagaac gtgagccaag ctatccacaa gtacaatgct 240 

tacagaaaag cagcatctgt tatagcaaaa tacccacaca aaataaagag tggagctgaa 300 

gctaagaaat tgcctggagt aggaacaaaa attgctgaaa agattgatga gtttttagca 360 

actggaaaat tacgtaaact ggaaaagatt cggcaggatg atacgagttc atccatcaat 420 

ttcctgactc gag 433 

. <210> 163 
<211> 432 
<212> DNA 
<213> Homo sapien 

<400> 163 

gaattcggca ccagatgagg ccaacgaggt gacggacagc gcgtacatgg gctccgagag 60 

cacctacagt gagtgtgaga ccttcacgga cgaggacacc agcaccctgg tgcaccctga 120 

gctgcaacct gaaggggacg cagacagtgc cggcggctcg gccgtgccct ctgagtgcct 180 

ggacgccatg gaggagcccg accatggtgc cctgctgctg ctcccaggca ggcctcaccc 240 

ccatggccag tctgtcatca cggtgatcgg gggcgaggag cactttgagg actacggtga 300 

aggcagtgag gcggagctgt ccccagagac cctatgcaac gggcagctgg gctgcagtga 360 

ccccgctttc ctcacgccca gtccgacaaa gcggctctcc agcaagaagg tggcaaggta 420 

cctgcaccag tc 432 

<210> 164 
<211> 395 
<212> DNA 
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<213> Homo sapien 



<400> 164 
gacacttgaa tcatgggtga cgttaaaaat 
atgaccccat cctatgaaat tagagcagtg 
gaggttcagg tggaaggtta taattacact 
gcacaaagca atgctgccag agactttgtt 
agtgaagaag ttccagcttt tggggtagca 
actacagcaa atgctgaagg catcttgttg 
ggttcctgaa aaaaaaaaaa aaaaaaaaac 

<210> 165 

<211> 503 

<212> DNA 

<213> Homo sapien 

<400> 165 

gaattcggca ccaggaacgc tcggtgagag gcggaggagc ggtaactacc ccggttgcgc 60 

acagctcggc gctccttccc gctccctcac acaccggcct cagcccgcac cggcagtaga 120 

agatggtgaa agaaacaact tactacgatg ttttgggggt caaacccaat gctactcagg 180 

aagaattgaa aaaggcttat aggaaactgg ccttgaagta ccatcctgat aagaacccaa 240 

atgaaggaga gaagtttaaa cagatttctc aagcttacga agttctctct gatgcaaaga 300 

aaagggaatt atatgacaaa ggaggagaac aggcaattaa agagggtgga gcaggtggcg 360 

gttttggctc ccccatggac atctttgata tgttttttgg aggaggagga aggatgcaga 420 

gagaaaggag aggtaaaaat gttgtacatc agctctcagt aaccctagaa gacttatata 480 

atggtgcaac aagaaaactg get 503 

<210> 166 
<211> 893 
<212> DNA 
<213> Homo sapien 

<400> 166 

gaattcggca cgagaggaac ttctcttgac gagaagagag accaaggagg ccaagcaggg 60 

gctgggccag aggtgecaac atggggaaac tgaggctegg cteggaaggg tgagagtgag 120 

actacatctc aaaaaaaaaa aaaaaaaaaa aaaagaaaga aaagaaaaga aaaaagaaag 180 

aacggaagta gttgtaggta gtggtatggt ggtatgagtc tgttttctgt tacttataac 240 

aacaacaaca acaaaaaacg ctgaaactgg gtaatttata aagaaaagga aaaaaagcag 300 

aaaaaaatca ggaagaagag aaaggaaaag aagacaaata aatgaaattt atgtattaca 360 

gttctgaagg ctgagacatc ccaggtcaag ggtccacact tggegaggge tttcttgctg 420 

gtggagactc tttgtggagt cctgggacag tgcagaagga tcacgcctcc ctaccgctcc 480 

aagcccagcc ctcagccatg gcatgccccc tggatcaggc cattggcctc ctcgtggcca 540 

tcttccacaa gtactccggc agggagggtg acaagcacac cctgagcaag aaggagctga 600 

aggagctgat ccagaaggag ctcaccattg gctcgaagct geaggatget gaaattgcaa 660 

ggctgatgga agacttggac eggaacaagg accaggaggt gaacttccag gagtatgtca 720 

ccttcctggg ggccttggct ttgatctaca atgaagcect caagggctga aaataaatag 780 

ggaagatgga gacaccctct gggggtcctc tctgagtcaa atccagtggt gggtaattgt 840 

acaataaatt ttttttggtc aaatttaaaa aaaaaaaaaa aaaaaaactc gag 893 

<210> 167 
<211> 549 
<212> DNA 
<213> Homo sapien 

<400> 167 

gaattcggca cgagcccaga tcccgaggtc cgacagcgcc cggcccagat ccccacgcct 60 

gecaggagea agecgagage cagccggccg gcgcactccg actccgagca gtctctgtcc 120 

ttcgacccga gccccgcgcc etttceggga cccctgcccc gcgggcagcg ctgccaacct 180 



tttctgtatg cctggtgtgg 
gggaacaaaa acaggcagaa 
ggcatgggaa attccaccaa 
aactatttgg ttcgaataaa 
tctccgcccc cacttactga 
acatcgaata tgactttgat 
tcgag 



caaaaggaag 60 

attcatgtgt 120 

taaaaaagat 180 

tgaaataaag 240 

tactcctgac 300 

aataaatacc 360 
395 
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gccggccatg gagaccccgt cccagcggcg cgccacccgc 
cactccgctg tcgcccaccc gcatcacccg gctgcaggag 
caatgatcgc ttggcggtct acatcgaccg tgtgcgctcg 
gctgcgcctt cgcatcaccg agtctgaaga ggtggtcagc 
ggccgcctac gaggccgagc tcggggatgc ccgcaagacc 
gcgcgcccgc ctgcagctgg agctgagcaa agtgcgtgaa 
gcgcaatac 



agcggggcgc aggccagctc 
aaggaggacc tgcaggagct 
ctggaaacgg agaacgcagg 
cgcgaggtgt ccggcatcaa 
cttgactcag tagccaagga 
gagtttaagg agctgaaagc 



240 
300 
360 
420 
480 
540 
549 



<210> 168 
<211> 547 
<212> DNA 
<213> Homo sapien 

<400> 168 

gaattcggca cgagatggcg gcaggggtcg aagcggcggc ggaggtggcg gcgacggaga 60 

tcaaaatgga ggaagagagc ggcgcgcccg gcgtgccgag cggcaacggg gctccgggcc 120 

ctaagggtga aggagaacga cctgctcaga atgagaagag gaaggagaaa aacataaaaa 180 

gaggaggcaa tcgctttgag ccatatgcca atccaactaa aagatacaga gccttcatta 240 

caaacatacc ttttgatgtg aaatggcagt cacttaaaga cctggttaaa gaaaaagttg 300 

gtgaggtaac atacgtggag ctcttaatgg acgctgaagg aaagtcaagg ggatgtgctg 360 

ttgttgaatt caagatggaa gagagcatga aaaaagctgc ggaagtccta aacaagcata 420 

gtctgagcgg aagaccactg aaagtcaaag aagatcctga tggtgaacat gccaggagag 480 

caatgcaaaa ggctggaaga cttggaagca cagtatttgt agcaaatctg gattataaag 540 

ttggctg 547 

<210> 169 
<211> 547 
<212> DNA 
<213> Homo sapien 



<210> 170 
<211> 838 
<212> DNA 
<213> Homo sapien 

<400> 170 

gaattcggca ccagaggagc tcggcctgcg ctgcgccacg atgtccgggg agtcagccag 60 

gagcttgggg aagggaagcg cgcccccggg gccggtcccg gagggctcga tccgcatcta 120 

cagcatgagg ttctgcccgt ttgctgagag gacgcgtcta gtcctgaagg ccaagggaat 180 

caggcatgaa gtcatcaata tcaacctgaa aaataagcct gagtggttct ttaagaaaaa 240 

tccctttggt ctggtgccag ttctggaaaa cagtcagggt cagctgatct acgagtctgc 300 

catcacctgt gagtacctgg atgaagcata cccagggaag aagctgttgc cggatgaccc 360 

ctatgagaaa gcttgccaga agatgatctt agagttgttt tctaaggtgc catccttggt 420 

aggaagcttt attagaagcc aaaataaaga agactatgat ggcctaaaag aagaatttcg 480 

taaagaattt accaagctag aggaggttct gactaataag aagacgacct tctttggtgg 540 

caattctatc tctatgattg attacctcat ctggccctgg tttgaacggc tggaagcaat 600 



<400> 169 



gaattcggca ccaggagtcc gactgtgctc gctgctcagc 
ctcgccgtgg gagccctgct ggtctgcgcc gtcctggggc 
aaaactgtga gatggtgtgc agtgtcggag catgaggcca 
gaccatatga aaagcgtcat tccatccgat ggtcccagtg 
tcctaccttg attgcatcag ggccattgcg gcaaacgaag 
gcaggtttgg tgtatgatgc ttacctggct cccaataacc 
ttctatgggt caaaagagga tccacagact ttctattatg 
gatagtggct tccagatgaa ccagcttcga ggcaagaagt 
aggtccgctg ggtggaacat ccccataggc ttactttact 
aaacctc 



gccgcacccg gaagatgagg 
tgtgtctggc tgtccctgat 
ctaagtgcca gagtttccgc 
ttgcttgtgt gaagaaagcc 
cggatgctgt gacactggat 
tgaagcctgt ggtggcagag 
ctgttgctgt ggtgaagaag 
cctgccacac gggtctaggc 
gtgacttacc tgagccacgt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
547 
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gaagttaaat gagtgtgtag accacactcc aaaactgaaa ctgtggatgg cagccatgaa 660 

ggaagatccc acagtctcag ccctgcttac tagtgagaaa gactggcaag gtttcctaga 720 

gctctactta cagaacagcc ctgaggcctg tgactatggg ctctgaaggg ggcaggagtc 780 

agcaataaag ctatgtctga tattttcctt cactaaaaaa aaaaaaaaaa aactcgag 838 

<210> 171 
<211> 547 
<212> DNA 
<213> Homo sapien 

<400> 171 
gaattcggca ccagcgggat ttgggtcgca 
cttgacaatg cagatcttcg tgaagactct 
gcccagtgac accatcgaga atgtcaaggc 
tgaccagcag aggctgatct ttgctggaaa 
ctacaacatc cagaaagagt ccaccctgca 
aatcttcgtg aagacactca ctggcaagac 
catcgagaac gtcaaagcaa agatccagga 
gttgatcttt gccggaaagc agctggaaga 
gaaagagtct accctgcacc tggtgctccg 
gaccctg 

<210> 172 
<211> 608 
<212> DNA 
<213> Homo sapien 

<400> 172 
gaattcggca ccagagactt ctccctctga 
caccctcatc tacaatggtg ccctgccatg 
tgagtgcaac cctcatggtg gtcagtgcct 
tgacctctgt gcccctggct actatggctt 
gggctgccgt gatcacacag ggggtgagca 
ggacccacgg ctgccatatg ggggccagtg 
gagccaacgg cactttgcta cttcttgcca 
ccactgccgg gcaggctata cggggctgcg 
ggacccatca aggccaggtg gccggtgcca 
aatggatcct gatgcctgtg acccccacac 
agagggtc 

<210> 173 
<211> 543 
<212> DNA 
<213> Homo sapien 

<400> 173 

gaattcggca ccagagatca tccgccagca gggtctggcc tcctacgact acgtgcgccg 60 

ccgcctcacg gctgaggacc tgttcgaggc tcggatcatc tctctcgaga cctacaacct 120 

gctccgggag ggcaccagga gcctccgtga ggctctcgag gcggagtccg cctggtgcta 180 

cctctatggc acgggctccg tggctggtgt ctacctgccc ggttccaggc agacactgag 240 

catctaccag gctctcaaga aagggctgct gagtgccgag gtggcccgcc tgctgctgga 300 

ggcacaggca gccacaggct tcctgctgga cccggtgaag ggggaacggc tgactgtgga 360 

tgaagctgtg cggaagggcc tcgtggggcc cgaactgcac gaccgcctgc tctcggctga 420 

gcgggcggtc accggctacc gtgaccccta caccgagcag accatctcgc tcttccaggc 480 

catgaagaag gaactgatcc ctactgagga ggccctgcgg ctgtggatgc ccagctggcc 540 

acc 543 

<210> 174 



gttcttgttt gtggattgct 
gactggtaag accatcaccc 
aaagatccaa gataaggaag 
acagctggaa gatgggcgca 
cctggtgctc cgtctcagag 
catcaccctt gaggtcgagc 
caaggaaggc attcctcctg 
tgggcgcacc ctgtctgact 
tctcagaggt gggatgcaga 



gtgatcgtca 60 

tcgaggttga 120 

gcatccctcc 180 

ccctgtctga 240 

gtgggatgca 300 

ccagtgacac 360 

accagcagag 420 

acaacatcca 480 

tcttcgtgaa 540 
547 



ggcctgcgca cccctcctca 
tcagtgcaac cctcaaggtt 
gtgcaagcct ggagtggttg 
tggccccaca ggctgtcaag 
ctgtgaaagg tgcattgctg 
ccggccctgt ccctgtcctg 
ccaggatgaa tattcccagc 
atgtgaagct tgtgcccctg 
actgtgtgag tgcagtggga 
ggggcaatgc ctgcgctgtt 



tcagcctgtc 60 

cactgagttc 120 

ggcgccgctg 180 

gcgcttgcct 240 

gtttccacgg 300 

aaggccctgg 360 

agattgtgtg 420 

ggcactttgg 480 

acattgaccc 540 

tacaccacac 600 
608 
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<211> 548 
<212> DNA 
<213> Homo sapien 

<400> 174 

gaattcggca cgagaaatgg cggcaggggt cgaagcggcg gcggaggtgg cggcgacgga 60 

gatcaaaatg gaggaagaga gcggcgcgcc cggcgtgccg agcggcaacg gggctccggg 120 

ccctaagggt gaaggagaac gacctgctca gaatgagaag aggaaggaga aaaacataaa 180 

aagaggaggc aatcgctttg agccatatgc caatccaact aaaagataca gagccttcat 240 

tacaaacata ccttttgatg tgaaatggca gtcacttaaa gacctggtta aagaaaaagt 300 

tggtgaggta acatacgtgg agctcttaat ggacgctgaa ggaaagtcaa ggggatgtgc 360 

tgttgttgaa ttcaagatgg aagagagcat gaaaaaagct gcggaagtcc taaacaagca 420 

tagtctgagc ggaagaccac tgaaagtcaa agaagatcct gatggtgaac atgccaggag 480 

agcaatgcaa aaggtgatgg ctacgactgg tgggatgggt atgggaccag gtggcccagg 540 

aatgatta 548 

<210> 175 
<211> 604 
<212> DNA 
<213> Homo sapien 

<400> 175 

gaattcggca ccagaggacc tccaggacat gttcatcgtc cataccatcg aggagattga 60 

gggcctgatc tcagcccatg accagttcaa gtccaccctg ccggacgccg atagggagcg 120 

cgaggccatc ctggccatcc acaaggaggc ccagaggatc gctgagagca accacatcaa 180 

gctgtcgggc agcaacccct acaccaccgt caccccgcaa atcatcaact ccaagtggga 240 

gaaggtgcag cagctggtgc caaaacggga ccatgccctc ctggaggagc agagcaagca 300 

gcagtccaac gagcacctgc gccgccagtt cgccagccag gccaatgttg tggggccctg 360 

gatccagacc aagatggagg agatcgggcg catctccatt gagatgaacg ggaccctgga 420 

ggaccagctg agccacctga agcagtatga acgcagcatc gtggactaca agcccaacct 480 

ggacctgctg gagcagcagc accagcttat ccaggaggcc ctcatcttcg acaacaagca 540 

caccaactat accatggagc acatccgcgt gggctgggag cagctgctca ccaccattgc 600 

ccgg 604 

<210> 176 
<211> 486 
<212> DNA 
<213> Homo sapien 

<400> 176 

gaattcggca ccagccaagc tcactattga atccacgccg ttcaatgtcg cagaggggaa 60 

ggaggttctt ctactcgccc acaacctgcc ccagaatcgt attggttaca gctggtacaa 120 

aggcgaaaga gtggatggca acagtctaat tgtaggatat gtaataggaa ctcaacaagc 180 

taccccaggg cccgcataca gtggtcgaga gacaatatac cccaatgcat ccctgctgat 240 

ccagaacgtc acccagaatg acacaggatt ctatacccta caagtcataa agtcagatct 300 

tgtgaatgaa gaagcaaccg gacagttcca tgtatacccg gagctgccca agccctccat 360 

ctccagcaac aactccaacc ccgtggagga caaggatgct gtggccttca cctgtgaacc 420 

tgaggttcag aacacaacct acctgtggtg ggtaaatggt cagagcctcc cggtcagtcc 480 

caaggc 486 

<210> 177 
<211> 387 
<212> DNA 
<213> Homo sapien 

<400> 177 

gaattcggca ccagggacag cagaccagac agtcacagca gccttgacaa aacgttcctg 60 
gaactcaagc tcttctccac agaggaggac agagcagaca gcagagacca tggagtctcc 120 
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ctcggcccct ccccacagat ggtgcatccc ctggcagagg ctcctgctca cagcctcact 180 

tctaaccttc tggaacccgc ccaccactgc caagctcact attgaatcca cgccgttcaa 240 

tgtcgcagag gggaaggagg tgcttctact tgtccacaat ctgccccagc atctttttgg 300 

ctacagctgg tacaaaggtg aaagagtgga tggcaaccgt caaattatag gatatgtaat 360 

aggaactcaa caagctaccc cagggcc 387 

<210> 178 

<211> 440 

<212> DNA 

<213> Homo sapien 

<400> 178 

gaattcggca cgaggagaag cagaaaaaca aggaatttag ccagacttta gaaaatgaga 60 

aaaatacctt actgagtcag atatcaacaa aggatggtga actaaaaatg cttcaggagg 120 

aagtaaccaa aatgaacctg ttaaatcagc aaatccaaga agaactctct agagttacca 180 

aactaaagga gacagcagaa gaagagaaag atgatttgga agagaggctt atgaatcaat 240 

tagcagaact taatggaagc attgggaatt actgtcagga tgttacagat gcccaaataa 300 

aaaatgagct attggaatct gaaatgaaga accttaaaaa gtgtgtgagt gaattggaag 360 

aagaaaagca gcagttagtc aaggaaaaaa ctaaggtgga atcagaaata cgaaaggaat 420 

atttggagaa aatacaaggt 440 

<210> 179 

<211> 443 

<212> DNA 

<213> Homo sapien 

<400> 179 

gaattcggca ccagcggggg gctacggcgg cggctacggc ggcgtcctga ccgcgtccga 60 

cgggctgctg gcgggcaacg agaagctaac catgcagaac ctcaacgacc gcctggcctc 120 

ctacctggac aaggtgcgcg ccctggaggc ggccaacggc gagctagagg tgaagatccg 180 

cgactggtac cagaagcagg ggcctgggcc ctcccgcgac tacagccact actacacgac 240 

catccaggac ctgcgggaca agattcttgg tgccaccatt gagaactcca ggattgtcct 300 

gcagatcgac aacgcccgtc tggctgcaga tgacttccga accaagtttg agacggaaca 360 

ggctctgcgc atgagcgtgg aggccgacat caacggcctg cgcagggtgc tggatgagct 420 

gaccctggcc aggaccgacc tgg 443 

<210> 180 
<211> 403 
<212> DNA 
<213> Homo sapien 

<400> 180 

gaattcggca cgaggttatg agagtcgact tcaatgttcc tatgaagaac aaccagataa 60 

caaacaacca gaggattaag gctgctgtcc caagcatcaa attctgcttg gacaatggag 120 

ccaagtcggt agtccttatg agccacctag gccggcctga tggtgtgccc atgcctgaca 180 

agtactcctt agagccagtt gctgtagaac tcagatctct gctgggcaag gatgttctgt 240 

tcttgaagga ctgtgtaggc ccagaagtgg agaaagcctg tgccaaccca gctgctgggt 300 

ctgtcatcct gctggagaac ctccgctttc atgtggagga agaagggaag ggaaaagatg 360 

cttctgggaa caaggttaaa gccgagccag ccaaaataga age 403 

<210> 181 
<211> 493 
<212> DNA 
<213> Homo sapien 

<400> 181 

gaattcggca ccagcagagg tctccagagc cttctctctc ctgtgcaaaa tggcaactct 60 
taaggaaaaa ctcattgcac cagttgcgga agaagaggca acagttccaa acaataagat 120 
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cactgtagtg ggtgttggac aagttggtat ggcgtgtgct atcagcattc tgggaaagtc 180 

tctggctgat gaacttgctc ttgtggatgt tttggaagat aagcttaaag gagaaatgat 240 

ggatctgcag catgggagct tatttcttca gacacctaaa attgtggcag ataaagatta 300 

ttctgtgacc gccaattcta agattgtagt ggtaactgca ggagtccgtc agcaagaagg 360 

ggagagtcgg ctcaatctgg tgcagagaaa tgttaatgtc ttcaaattca ttattcctca 420 

gatcgtcaag tacagtcctg attgcatcat aattgtggtt tccaacccag tggacattct 480 

tacgtatgtt acc 493 

<210> 182 

<211> 209 

<212> PRT 

<213> Homo sapien 



<400> 182 



































1 








5 










10 










15 Y 


ihr 








20 










25 










30 






Ala 




35 




Val 






Ala 
40 










45 


Arg 


His 




Pro 


Tyr 
50 


Ala 


Gin 


Arg 


Gin 


Phe 
55 


Leu 


Lys 


Leu 


Gly 


Gly 
60 


Leu 


Gin 


Val 


Leu 


Arg 


Thr 


Leu 


Val 


Gin 


Glu 


Lys 


Gly 


Thr 


Glu 


Val 


Leu 


Ala 


Val 


Arg 


Val 


65 










70 










75 










80 


Val 


Thr 


Leu 


Leu 


Tyr 
85 


Asp 


Leu 


Val 


Thr 


Glu 
90 


Lys 


Met 


Phe 


Ala 


Glu 
95 


Glu 


Glu 


Ala 


Glu 


Leu 


Thr 


Gin 


Glu 


Met 


Ser 


Pro 


Glu 


Lys 


Leu 


Gin 


Gin 


Tyr 








100 










105 










110 




Arg 


Gin 


Val 
115 


His 


Leu 


Leu 


Pro 


Gly 
120 


Leu 


Trp 


Glu 


Gin 


Gly 
125 


Trp 


Cys 


Glu 


He 


Thr 
130 


Ala 


His 


Leu 


Leu 


Ala 
135 


Leu 


Pro 


Glu 


His 


Asp 
140 


Ala 


Arg 


Glu 


Lys 


Val 


Leu 


Gin 


Thr 


Leu 


Gly 


Val 


Leu 


Leu 


Thr 


Thr 


Cys 


Arg 


Asp 


Arg 


Tyr 


145 










150 










155 










160 


Arg 


Gin 


Asp 


Pro 


Gin 
165 


Leu 


Gly 


Arg 


Thr 


Leu 
170 


Ala 


Ser 


Leu 


Gin 


Ala 
175 


Glu 


Tyr 


Gin 


Val 


Leu 
180 


Ala 


Ser 


Leu 


Glu 


Leu 
185 


Gin 


Asp 


Gly 


Glu 


Asp 
190 


Glu 


Gly 


Tyr 


Phe 


Gin 
195 


Glu 


Leu 


Leu 


Gly 


Ser 
200 


Val 


Asn 


Ser 


Leu 


Leu 
205 


Lys 


Glu 


Leu 



Arg 



<210> 183 
<211> 255 
<212> PRT 
<213> Homo sapien 



<400> 183 



Met 


Ala Ala 


Gly Val Glu 


Ala 


Ala 


Ala 


Glu 


Val 


Ala 


Ala 


Thr 


Glu 


Pro 


1 




5 








10 










15 




Lys 


Met Glu 


Glu Glu Ser 


Gly 


Ala 


Pro 


Cys 


Val 


Pro 


Ser 


Gly 


Asn 


Gly 






20 






25 










30 






Ala 


Pro Gly 


Pro Lys Gly 


Glu 


Glu 


Arg 


Pro 


Thr 


Gin 


Asn 


Glu 


Lys 


Arg 




35 






40 










45 








Lys 


Glu Lys 


Asn He Lys 


Arg 


Gly 


Gly 


Asn 


Arg 


Phe 


Glu 


Pro 


Tyr 


Ser 




50 




55 










60 










Asn 


Pro Thr 


Lys Arg Tyr 


Arg 


Ala 


Phe 


He 


Thr 


Asn 


He 


Pro 


Phe 


Asp 
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65 










70 




75 


80 


Val 


Lys 


Trp 


Gin 


Ser 




Lys Asp Leu Val 


Lys Glu Lys 


Val Gly Glu 










85 




90 




95 


Val 


Thr 


Tyr 


Val 


Glu 


Leu 


Leu Met Asp Ala 


Glu Gly Lys 


Ser Arg Gly 








100 






105 




110 


Cys 


Ala 


Val 


Val 


Glu 


Phe 


Lys Met Glu Glu 


Ser Met Lys 


Lys Ala Ala 






115 








120 


125 




Glu 


Val 




Asn 


Lys 


His 


Ser Leu Ser Gly 


Arg Pro Leu 


Lys Val Lys 




130 










135 


140 




Glu 


Asp 


Pro 


Asd 


Gly 


Glu 


His Ala Arg Arg 


Ala Met Gin 


Lys Ala Gly 


145 










150 




155 


160 


Arg 


Leu 


Gly 


Ser 


Thr 


Val 


Phe Val Ala Asn 


Leu Asp Tyr 


Lys Val Gly 










165 




170 




175 


Trp 


Lys 


Lys 


Leu 


Lys 


Glu 


Val Phe Ser Met 


Ala Gly Val 


Val Val Arg 








180 






185 




190 


Ala 


Asp 


lie 


Leu 


Glu 


Asp 


Lys Asp Gly Lys 


Ser Arg Gly 


He Gly He 






195 








200 


205 




Val 


Thr 


Phe 


Glu 


Gin 


Ser 


He Glu Ala Val 


Gin Ala He 


Ser Met Phe 




210 










215 


220 




Asn 


Gly 


Gin 


Leu 


Leu 


Phe 


Asp Arg Pro Met 


His Val Lys 


Met Asp Glu 


225 










230 




235 


240 


Arg 


Ala 


Leu 


Pro 


Lys 


Gly 


Asp Phe Phe Pro 


Pro Glu Arg 


His Ser 










245 




250 




255 




<210> 


184 














<211> 


188 














<212> 


PRT 














<213> 


Homo sapien 










<400> 


184 












Leu 


Ser 


Gly 


Ser 


Cys 


He 


Arg Arg Glu Gin 


Thr Pro Glu 


Lys Glu Lys 


1 








5 




10 




15 


Gin 


Val 


Val 


Leu 


Phe 


Glu 


Glu Ala Ser Trp 


Thr Cys Thr 










20 






25 




30 


Gly 


Asp 


Glu 


Pro 


Arg 


Thr 


Val He Leu Leu 


Ser Ser Met 


Leu Ala Asp 






35 








40 


45 


His 


Arg 


Leu 


Lys 


Leu 


Glu 


Asp Tyr Lys Asp 


Arg Leu Lys 


Ser Gly Glu 




50 










55 


60 




His 


Leu 


Asn 


Pro 


Asp 


Gin 


Leu Glu Ala Val 


Glu Lys Tyr 


Glu Glu Val 


65 










70 




75 


80 


Leu 


His 


Asn 


Leu 


Glu 


Phe 


Ala Lys Glu Leu 


Gin Lys Thr 


Phe Seir Gly 










85 




90 




95 


Leu 


Ser 


Leu 


Asp 


Leu 


Leu 


Lys Ala Gin Lys 


Lys Ala Gin 


Aurcy Aircj Glu 








100 






105 




110 


His 


Met 


Leu 


Lys 


Leu 


Glu 


Ala Glu Lys Lys 


Lys Leu Arg 


Thr* lie Leu 






115 








120 


125 




Gin 


Val 


Gin 


Tyr 


Val 


Leu 


Gin Asn Leu Thr 


Gin Glu His 


Val Gin Lys 




130 










135 


140 


Asp 


Phe 


Lys 


Gly 


Gly 


Leu 


Asn Gly Ala Val 






145 










150 






Ser Lys G^u 


Leu 


Asp 


Tyr 


Leu 


He 


Lys 


Phe Ser Lys Leu 


Thr Cys Pro 


Glu Arg Asn 










165 




170 




175 


Glu 


Ser 


Leu 


Arg 


Gin 


Thr 


Leu Glu Gly Ser 


Thr Val 










180 






185 








<210> 


185 














<211> 


746 














<212> 


PRT 
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<213> Homo sapien 



<400> 185 



Asp 


Lys 


His 


Leu 


Lys 


Asp 


Leu 


Leu 


Ser 


Lys Leu 


Leu Asn Ser Gly Tyr 


1 








5 










10 


15 


Phe 


Glu 


Ser 


He 




Val 


Pro 


Lvs 


Asn 


Ala Lys 


Glu Lys Glu Val Pro 








20 
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30 


Leu 


Glu 


Glu 


Glu 


Met 


Leu 


He 


Gin 


Ser 


Glu Lys 


Lys Thr Gin Leu Ser 






35 










40 






45 


Lys 


Thr 


Glu 


Ser 


Val 


Lys 


Glu 




Glu 


Ser Leu 


Met Glu Phe Ala Gin 




50 










55 








60 


Pro 


Glu 


lie 


Gin 


Pro 


Gin 


Glu 


Phe 


Leu 


Asn Arg 


Arg Tyr Met Thr Glu 
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Val 


Asp 


Tvr 


Ser 


Asn 


Lys 


Gin 
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Glu 


Glu Gin 


Pro Trp Glu Ala Asp 










85 
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Tyr 


Ala 


Arg 


Lys 


Pro 


Asn 


Leu 


Pro 


Lys 


Arg Trp 


Asp Met Leu Thr Glu 
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Pro 


Asp 


Gly 


Gin 


Glu 


Lys 


Lys 


Gin 


Glu 


Ser Phe 


Lys Ser Trp Glu Ala 
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120 
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Ser 


Gly 


Lys 


His 


Gin 


Glu 


Val 


Ser 


Lys 


Pro Ala 


Val Ser Leu Glu Gin 
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135 








140 


Arg 


Lys 


Gin 


Asp 


Thr 


Ser 


Lys 


Leu 


Arg 


Ser Thr 


Leu Pro Glu Glu Gin 


145 










150 








155 


160 


Lys 


Lvs 


Gin 


Glu 


He 


Ser 


Lvs 


Ser 


Lvs 




Pro Ser Gin Trp Lys 










165 










170 




Gin 


Asp 


Thr 


Pro 


Lys 


Ser 


ys 


Ala 


Gly 


Tyr Val 


Gin Glu Glu Gin Lys 








180 










185 




190 


Lvs 


Gin 


Glu 




Pro 


L 


Leu 


Trp 




Val Gin 








195 










200 






205 


Asp 


Pro 


Lys 


Lvs 


Gin 


Thr 


Pro 


Lys 


Ser 


Trp Thr 


Pro Ser Met Gin Ser 




210 










215 








220 


Glu 


Gin 


Asn 


Thr 


Thr 


Lys 


Ser 


Trp 


Thr 


Thr Pro 


Met Cys Glu Glu Gin 


225 










230 








235 


240 


Asp 


Ser 


Lvs 


Gin 


Pro 


Glu 


Thr 


Pro 


Lvs 




Glu Asn Asn Val Glu 










245 










250 


255 


Ser 


Gin 


Lvs 


His 




Leu 


Thr 


Ser 


Gin 


Ser Gin 


He Ser Pro Lys Ser 








260 










265 




270 


Trp 


Gly 


Val 


Ala 


Thr 


Ala 


Ser 


Leu 


He 


Pro Asn 


Asp Gin Leu Leu Pro 






275 










280 






285 


Arcr 


Lys 




Asn 


Thr 


Glu 


Pro 


Lys 


Asp 


Val Pro 


Lys Pro Val His Gin 




290 










295 








300 


Pro 


Val 


Gly 


Ser 


Ser 


Ser 




Leu 


Pro 


Lys Asp 


Pro Val Leu Arg Lys 


305 










310 








315 


320 


Glu 


Lys 


Leu 


Gin 


Asp 




Met 


Thr 


Gin 


He Gin 


Gly Thr Cys Asn Phe 










325 










330 


335 


Met 


Gin 


Glu 


Ser 


Val 


Leu 


Asp 


Phe 


Asp 


Lys Pro 


Ser Ser Ala Xle Pro 








340 










345 






Thr 


Ser 


Gin 


Pro 


Pro 


Ser 


Ala 


Thr 


Pro 


Gly Ser 


Pro Val Ala Ser Lys 






355 










360 






365 


Glu 


Gin 


Asn 


Leu 


Ser 


Ser 


Gin 


Ser 


Asp 


Phe Leu 


Gin Glu Pro Leu Gin 




370 










375 








380 


Val 


Phe 


Asn 


Val 


Asn 


Ala 


Pro 


Leu 


Pro 


Pro Arg 


Lys Glu Gin Glu He 


385 










390 








395 


400 


Lys 


Glu 


Ser 


Pro 


Tyr 


Ser 


Pro 


Gly 


Tyr 


Asn Gin 


Ser Phe Thr Thr Ala 










405 










410 


415 


Ser 


Thr 


Gin 


Thr 


Pro 


Pro 


Gin 


Cys 


Gin 


Leu Pro 


Ser He His Val Glu 








420 










425 




430 


Gin 


Thr 


Val 


His 


Ser 


Gin 


Glu 


Thr 


Ala 


Ala Asn 


Tyr His Pro Asp Gly 
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435 






440 




445 






Thr lie 


Gin 


Val 


Ser 


Asn Gly Ser Leu 


Ala 


Phe Tyr Pro 


Ala Gin 


Thr 


450 








455 




460 






Asn Val 


Phe 


Pro 


Arg 


Pro Thr Gin Pro 


Phe 


Val Asn Ser 


A Gl 
rg y 


Ser 


465 








470 




475 




480 


Val Arg 


Glv 


Cys 


Thr 


Arg Gly Gly Arg 


Leu 


lie Thr Asn 


Ser Tyr 


Arg 








485 




490 




495 




Ser Pro 


Glv 


Gly Tyr 


Lys Gly Phe Asp 




Tyr Arg Gly 


Leu Pro 


Ser 
er 






500 




505 










lie Ser 


Asn 


Gly 


Asn 


Tyr Ser Gin Leu 


Gin 


Phe Gin Ala 


Ar Glu 
rg u 


T 

yr 




515 






520 




525 






Ser Gly 


Ala 


Pro 


Tyr 


Ser Gin Arg Asp 


Asn 


Phe Gin Gin 


Cvs Tvr 

y y 


L 


530 








535 












Gly 


Thr 


Ser 


Gly Gly Pro Arg 


Ala 


Asn Ser Arg 


Ala Gl 
a y 




545 








550 




555 




560 


Ser Asp 


Ser 


Ser 


Gin 


Val Ser Ser Pro 


Glu 


Arg Asp Asn 


Glu Thr 


Phe 








565 




570 




575 




Asn Ser 


Gly 


Asp 


Ser 


Gly Gin Gly Asp 




Arg Ser Met 




Val 










585 






590 




Asp Val 


Pro 


Val 


Thr 


Asn Pro Ala Ala 


Thr 


He Leu Pro 


Val His 


Val 




595 






600 




605 






Tyr Pro 




Pro 


Gin 


Gin Met Arg Val 


Ala 


Phe Ser Ala 


Ala Arq 


Thr 
r 


610 








615 




620 








Leu 


Ala 


Pro 


Gly Thr Leu Asp 


Gin 


Pro He Val 


Phe As 
e sp 


Leu 


625 








630 




635 




fi40 




Asn 


Asn 


Leu 


Gly Glu Thr Phe 


Asp 


Leu Gin Leu 


Gl A 


Phe 








645 




650 




Y 655 




Asn Cys 


Pro 


Val 


Asn 


Gly Thr Tyr Val 


Phe 


He Phe His 


Met Leu 


ys 






660 




665 






670 




Leu Ala 


Val 


Asn 


Val 




Asn 
sn 


Leu Met Lys 


Asn Glu 


Glu 




675 






680 




685 






Val Leu 


Val 


Ser 


Ala 


Tyr Ala Asn Asp 


Gly 


Ala Pro Asp 


His Glu 


Thr 


690 








695 




700 






Ala Ser 


Asn 


His 


Ala 


lie Leu Gin Leu 


Phe 


Gin Gly Asp 


Gin He 


Trp 


705 








710 




715 




720 


Leu Arg 


Leu 


His Arg 


Gly Ala He Tyr 


Gly 


Ser Ser Trp 


Lys Tyr 


Ser 








725 




730 




735 




Thr Phe 


Ser 


Gly Tyr 


Leu Leu Tyr Gin 


Asp 












740 




745 











<210> 186 
<211> 705 
<212> PRT 
<213> Homo sapien 





<400> 


186 
























Ala 


Leu 


Leu 


Asn 


Val 


Arg 


Gin Pro 


Pro 


Ser 


Thr 


Thr 


Thr 


Phe Val 


Leu 


1 








5 








10 










15 




Asn 


Gin 


He 


Asn 


His 


Leu 


Pro Pro 


Leu 


Gly 


Ser 


Thr 


He 


Val 


Met 


Thr 








20 








25 








30 






Lys 


Thr 


Pro 


Pro 


Val 


Thr 


Thr Asn 


Arg 


Gin 


Thr 


He 


Thr 


Leu 


Thr 


Lys 






35 








40 










45 








Phe 


He 


Gin 


Thr 


Thr 


Ala Ser Thr Arg 


Pro 


Ser 


Val 


Ser 


Ala 


Pro 


Thr 




50 










55 








60 










Val 


Arg 


Asn 


Ala 


Met 


Thr 


Ser Ala 


Pro 


Ser 


Lys Asp Gin 


Val 


Gin 


Leu 


65 










70 








75 










80 


Lys Asp 


Leu 


Leu 


Lys 


Asn 


Asn Ser 


Leu 


Asn 


Glu 


Leu 


Met 


Lys 


Leu 


Lys 



85 90 95 
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Pro 


Pro 


Ala 




lie 


Ala 






Val 


Ala Thr Ala Ala Thr Asp Val 








100 










105 




Ser 


Asn 


Gly 




Val 


ys 


ys 


Glu 


Ser 


Ser Asn L s Glu Gi° Al A 






115 










] 20 




Y Y a rg 


Met 


Trp 


lie 


Asn 


sp 


Met 




Met 


rg 


Ser Phe Ser Pr Th M t L 




130 










135 






S 140 r ° r 6 ^ 


Val 




Val 


Val 


ys 


Gl 


sp 


sp 


Gl 

U 


Pro Glu Glu Glu Asp Glu Glu 


145 










150 










Glu 


Met 


Gl 


His 


Ala 


Glu 
U 


Thr 
r 


Tyr 


Ala 


Glu T^r Met Pro 11 L L^ 










165 










170 Y ro e ^ys eu 


L 


lie 


Gl 


Leu 


Arg 


His 


Pro 
ro 


sp 


Ala 


Val Val Glu Thr Ser Ser Leu 








180 










185 


190 


Ser 


Ser 


Val 




Pro 


Pro 
ro 


sp 


Val 


Trp 


Tyr Lys Thr Ser He Ser Glu 






195 










200 






Glu 


Thr 


lie 


Asd 


Asn 


Gl 

Y 


Trp 




Ser 


Ala Leu Gin Leu Glu Ala He 




210 










215 






220 


Thr 


Tyr 


Ala 


Ala 


Gin 


Gin 


His 


Glu 


Thr 




225 










230 








235 240 


Ala 


Gly 


Phe 


Leu 


lie 


Gly 


sp 


Gl 

Y 


Ala 


Gly Val Gly Lys Gly Arg Thr 










245 










250 255 


lie 


Ala 


Gl 


lie 


lie 


yr 


Glu 
U 


Asn 


iyr 


Leu Leu Ser Arg Lys Arg Ala 








260 














Leu 


Trp 


Phe 




Val 


Ser 


Asn 


Asp 


Leu 


Lys Tyr Asp Ala Glu Arg Asp 






275 










280 




285 


Leu 


Arg 


Asp 


lie 


- 

Y 


Ala 


Lys 


Asn 


6 


Leu Val His Ser Leu Asn Lys 




290 










295 








Phe 


Lys 


yr 


n 

Y 


ys 


lie 


Ser 


Ser 
er 


ys 


His Asn Gly Ser Val Lys Lys 


305 




















Gly 


Val 


lie 


Phe 


Ala 
a 


Thr 
r 


Tyr 


Ser 
er 


r 
er 


Leu He Gly Glu Ser Gin Ser 






















Gl 


Gl 


ys 


ryr 


ys 


Thr 


Arg 


Leu 
eu 


Lys 


Gin Leu Leu His Trp Cys Gly 








340 














sp 


sp 


Phe 


sp 


y 


Val 


lie 


Val 


6 


Asp Glu Cys His Lys Ala Lys 






355 










360 




365 


Asn 


Leu 


Cys 


Pro 
ro 


_ 

a 


Y 


Ser 


Ser 


ys 


Pro Thr Lys Thr Gly Leu Ala 




370 










37R 








Val 




Glu 


Leu 
eu 


Gin 
n 


Asn 


ys 


Leu 
eu 


Pro 
ro 


Lys Ala Arg Val Val Tyr Ala 


385 










390 












Ala 


Thr 


Gl 

Y 


Ala 


Ser 


Glu 


Pro 


rg 


Asn Met Ala T r Met Asn Ar° 










405 










410 ^ S 415 rg 


Leu 


Gly 


lie 


Trp 


Gly 


Glu 


Gl 

Y 


Thr 


Pro 


Phe Arg Glu Phe Ser Asp Phe 








420 










425 


430 


lie 


Gin 


Ala 




Glu 
U 


rg 


rg 


n 


Val 


Gly Ala Met Glu He Val Ala 






435 










440 






Met 


Asp 




ys 


Leu 
eu 


rg 


Gl 


Met 


lyr 


11 Al A Gl 5 




450 










455 






6 a 460 n eu er e 


Thr 


Gly 


Val 


Thr 


Ph 




lie 


Gl 

U 


Gl 




465 










470 








3 475 SU ^ n ^ 480 


Val 


ys 


Met 


T 

yr 


A n 


ys 


Ala 
a 


V 

a 


ys 


Leu Trp Val lie Ala Arg Glu 










485 










490 495 


Arg 


Phe 


Gin 


Gin 


Ala 


Ala 


Asp 


Leu 


lie 


Asp Ala Glu Gin Arg Met Lys 








500 










505 


510 


Lys 


Ser 


Met 


Trp 


Gly 


Gin 


Phe 


Trp 


Ser 


Ala His Gin Arg Phe Phe Lys 






515 










520 




525 


Tyr 


Leu 


Cys 


lie 


Ala 


Ser 


Lys 


Val 


Lys 


Arg Val Val Gin Leu Ala Arg 




530 










535 






540 


Glu 


Glu 


lie 


Lys 


Asn 


Gly 


Lys 


Cys 


Val 


Val He Gly Leu Gin Ser Thr 


545 










550 








555 560 



WO 02/092001 PCT7US02/14975 



Gly 






Arg 








Ala Leu 
















565 








570 


575 




Asn 


sp 


Phe 


Val 


Ser 


Thr 


Ala 


Lys Gly 




lie 


Glu 








580 








585 


590 






ys 


His 


Phe 


Pro 


Ala 


Pro 


sp 




Lys Leu Tyr Ser Leu 


Leu 


Gly 






595 










600 








lie 


Asp 


Leu 


Thr 


Ala 


Pro 


Ser 




Ser Ser Pro Arg Asp 


Ser 


Pro 




610 










615 




620 






Cys 


Lys 


Glu 


Asn 


ys 


lie 


Lys 


T 

ys rg 


Lys Gly Glu Glu lie 


Thr 


Arg 


625 










630 






635 




640 


Glu 


Ala 


Lys 


Lys 


Ala 


Arg 


Lys 


Val Gly 


Gly Leu Thr Gly Ser 


Ser 


Ser 










645 








650 


655 




Asp 


Asp 


Ser 


Gly 


Ser 


Glu 


Ser 


Asp Ala 


Ser Asp Asn Glu Glu 


Ser Asp 








660 








665 


670 






Tyr 


Glu 


Ser 


Ser 


Lys 


Asn 


Met 


Ser Ser 


Gly Asp Asp Asp Asp 


Phe 


Asn 






675 










680 


685 






Pro 


Phe 


Leu 


Asp 


Glu 


Ser 


Asn 


Glu Asp 


Asp Glu Asn Asp Pro 


Trp Leu 



690 695 700 



lie 
705 

<210> 187 
<211> 595 
<212> PRT 
<213> Homo sapien 



<400> 187 



Glu 


Ser 


Pro 


Arg 


His 


Arg 


Gly 


Glu Gly 


Gly Gly Glu Trp Gly 


Pro Gly 


1 








5 








10 


15 


Val 


Pro 


Arg 


Glu 


Arg 


Arg 


Glu 


Ser Ala 


Gly Glu Trp Gly Ala 


Asp Thr 








20 








25 


30 




Pro 


Lys 


Glu 


Gly 


Gly 


Glu 


Ser 


Ala Gly 


Glu Trp Gly Ala Glu 


Val Pro 






35 










40 


45 




Arg 


Gly 


Arg 


Gly 


Glu 


Gly 


Ala 


Gly Glu 


Trp Gly Pro Asp Thr 


Pro Lys 




50 










55 




60 




Glu 


Arg 


Gly 


Gin 


Gly 


Val 


Arg 


Glu Trp 


Gly Pro Glu lie Pro 


Gin Glu 


65 










70 






75 


80 


His 


Gly 


Glu 


Ala 


Thr 


Arg 


Asp 


Trp Ala 


Leu Glu Ser Pro Arg 


Ala Leu 










85 








90 


95 


Gly 


Glu 


Asp 


Ala 


Arg 


Glu 


Leu 


Gly Ser 


Ser Pro His Asp Arg 


Gly Ala 








100 








105 


110 




Ser 


Pro 


Arg 


Asp 


Leu 


Ser 


Gly 


Glu Ser 


Pro Cys Thr Gin Arg 


Ser Gly 






115 










120 


125 




Leu 


Leu 


Pro 


Glu 


Arg 


Arg 


Gly 


Asp Ser 


Pro Trp Pro Pro Trp 


Pro Ser 




130 










135 




140 




Pro 


Gin 


Glu 


Arg 


Asp 


Ala 


Gly 


Thr Arg 


Asp Arg Glu Glu Ser 


Pro Arg 


145 










150 






155 


160 


Asp 


Trp 


Gly 


Gly 


Ala 


Glu 


Ser 


Pro Arg 


Gly Trp Glu Ala Gly 


Pro Arg 










165 








170 


175 


Glu 


Trp 


Gly 


Pro 


Ser 


Pro 


Ser 


Gly His 


Gly Asp Gly Pro Arg 


Arg Arg 








180 








185 


190 




Pro 


Arg 


Lys 


Arg 


Arg 


Gly 


Arg 


Lys Gly 


Arg Met Gly Arg Gin 


His Glu 






195 










200 


205 




Ala 


Ala 


Ala 


Thr 


Ala 


Ala 


Thr 


Ala Ala 


Thr Ala Thr Gly Gly 


Thr Ala 




210 










215 




220 




Glu 


Glu 


Ala 


Gly 


Ala 


Ser 


Ala 


Pro Glu 


Ser Gin Ala Gly Gly 


Gly Pro 


225 










230 






235 


240 


Arg 


Gly 


Arg 


Ala 


Arg 


Gly 


Pro 


Arg Gin 


Gin Gly Arg Arg Arg 


His Gly 
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245 




250 


255 


Thr 


Gin 


Arg 


Arg 


Arg 


Gly Pro Pro Gin 


Ala Arg 


Glu Glu Gly Pro Arg 








260 




265 




270 


Asp 


Ala 


Thr 


Thr 


lie 


Leu Gly Leu Gly 


Thr Pro 


Ser Gly Glu Gin Arg 






275 






280 




285 


Ala 


Asp 


Gin 


Ser 


Gin 


Ala Leu Pro Ala 


Leu Ala 


Gly Ala Ala Ala Ala 




290 








295 




300 


His 


Ala 


His 


Ala 


lie 


Pro Gly Ala Gly 


Pro Ala 


Ala Ala Pro Val Gly 


305 










310 


315 


320 


Gly 


Arg 


Gly 


Arg 


Arg 


Gly Gly Trp Arg 


Gly Gly 


Arg Arg Gly Gly Ser 










325 




330 


335 


Ala 


Gly 


Ala 


Gly 


Gly 


Gly Gly Arg Gly 


Gly Arg 


Gly Arg Gly Arg Gly 








340 




345 




350 


Gly 


Gly 


Arg 


Gly 


Gly 


Gly Gly Ala Gly 


Arq Glv 


Gly Gly Ala Ala Gly 






355 






360 




365 


Pro 


Arg 


Glu 


Gly 


Ala 


Ser Ser Pro Gly 


Ala Arq 


Arg Gly Glu Gin Arg 




370 








375 




380 


Arg 




Gly 


Arg 


Gly 


Pro Pro Ala Ala 


Gly Ala 


Ala Gin Val Ser Ala 


385 










390 


395 


400 


Arg 


Gly 


Arg 


Arg 


Ala 


Arg Gly Gin Arg 


Ala Gly 


Glu Glu Ala Gin Asp 










405 




410 


415 


Gly 


Leu 


Leu 


Pro 


Arg 


Gly Arg Asp Arg 












420 




425 




430 ^ P 


Ala 


Asn 


Gin 


Arg 


Ala 


Glu Arg Pro Gly 


Pro Pro 


Arg Gly Gly His Gly 






435 






440 




445 


Pro 


Val 




Ala 


Ser 


Ser Ala Pro Asp 


Thr Ser 


Pro Pro Arg His Pro 




450 








455 




460 


Arg 


Arg 


Trp 


Val 


Ser 


Gin Gin Arg Gin 


Arg Leu 


Trp Arg Gin Phe Arg 


465 










470 


475 




Val 


Gly 


Gly 


Gly 


Phe 


Pro Pro Pro Pro 




Arg Pro Pro Ala Val 










485 




490 


495 


Leu 


Leu 


Pro 


Leu 




Arg Leu Ala Cys 


Ala Gly 










500 




505 




510 


Arg 


Pro 


Gly 


Pro 


Arg 


Arg Pro Ala Arg 


Arq Pro 


Arg Gly Glu Leu He 






515 






520 




525 


Pro 


Arg 


Arg 


Pro 


Asp 


Pro Ala Ala Pro 


Ser Glu 


Glu Gly Leu Arg Met 




530 








535 






Glu 


Ser 


Ser 


Val 


Asp 


Asp Gly Ala Thr 


Ala Thr 


Thr Ala Asp Ala Ala 


545 










550 


555 


560 




Gly 


Glu 


Ala 


Pro 


Glu Ala Gly Pro 




Ssr* His Ssr 1 Pro Thr 










565 








Met 


Cys 


Gin 


Thr 




Gly Pro Gly Pro 


Pro Pro 


Pro GXn Pro Pro Arg - 








580 




585 




590 


Trp 


Leu 


Pro 
















595 














<210> 


188 












<211> 


376 












<212> 


PRT 












<213> 


Homo sapien 








<400> 


188 










Glu 


Met Arg 


Lys Phe Asp Val Pro Ser 


Met Glu 


Ser Thr Leu Asn Gin 


1 








5 




10 


15 


Pro 


Ala 


Met 


Leu 


Glu Thr Leu Tyr Ser 


Asp Pro 


His Tyr Arg Ala His 








20 




25 




30 


Phe 


Pro 


Asn 


Pro 


Arg Pro Asp Thr Asn 


Lys Asp 


Val Tyr Lys Val Leu 






35 






40 




45 
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Pro 


Glu 




Lys 








Gly 




Gly Ala Val 


Phe Glu Arg Asn 




50 










55 






60 




Gly 


Pro 


His 


Ala 


Ser 


Ser 


Ser 


Gly 


Val 


Leu Pro Leu 


Glv Le Gl 


65 










70 










80° 


Ala 


Pro 


Glv 


Leu 


Ser 




Ser 


Leu 


Ser 


Ser Gin Val 


Trp Gin Pro Ser 










85 










90 


11 95° ^ 


Pro 


sp 


Pro 


Trp 




Pro 


Glv 


Glu 


Gin 


Ser Cys Glu 


Leu Ser Thr Cys 








100 










105 




110 


Ar 


Gin 


Gin 




Glu 


Leu 


He 


Arg 




Gin Met Glu 


Gin Met Gin Leu 






115 










120 






125 


Gin 


Asn 


Gly 


Ala 


Met 


Cvs 


His 


His 


Pro 


Ala Ala Phe 


Ala Pro Leu Leu 




130 










135 






140 




Pro 




Leu 


Glu 


Pro 


Ala 


Gin 


Trp 


Leu 


Ser Xle Leu 


Asn Ser Asn Glu 


145 










150 








155 




His 


Leu 


Leu 


L 


Glu 


Lys 


Glu 


Leu 


Leu 


He Asp Lys 


Gin Arg Lys His 










165 










170 


175 


He 


Ser 


Gin 


Leu 


Glu 


Gin 


L 


Val 


Arg 


Glu Ser Glu 


Leu Gin Val His 








180 










185 




190 


Ser 


Ala 


Leu 




Gl 


A 


Pro 


Ala 




Phe Gly Asp 








195 










200 






205 


Arq 


Leu 


Gin 


Glu 


Leu 


Gin 


Arg 


Glu 


Asn 


Thr Phe Leu 


Arg Ala Gin Phe 




210 










215 






220 




Ala 




y 


Thr 


Glu 


Ala 




Ser 


ys 


Glu Lys Met 


Glu Leu Glu Lys 


225 










230 








235 


240 


Lys 


Leu 


Ser 


Ala 


Ser 




Val 


Glu 


He 


Gin Leu He 












245 










250 


255 


L 


Val 


Thr 


Leu 




L 


His 


Ser 


Glu 


Glu Gly Lys 


Lys Gin Glu Glu 








260 










265 




L ?0 


rg 


Val 


Lys 


Gly 


rg 


sp 


ys 


His 




Asn Asn Leu 








275 










280 






285 yS ^ ^ 


Gin 


Lys 




Ser 


Glu 


Gin 


Asn 


A 


Glu 


Lys Gin Gin 


Arg lie Glu Thr 




290 










295 






300 




Leu 


Glu 


A 


Tvr 


Leu 


Ala 


Asp 


Leu 


Pro 


Thr Leu Glu 


Asp His Gin Lys 


305 










310 








315 


320 


Gin 


Thr 


Glu 


Gin 


Leu 


Lys 


Asp 


Ala 


Glu 


Leu Lys Asn 


Thr Glu Leu Gin 










325 










330 


335 


Glu 


Arg 


Val 


Ala 


Glu 


Leu 


Glu 
U 


Thr 


Leu 


Leu Glu Asp 


Thr Gin Ala Thr 








340 










345 






Cys 
ys 


Arg 


Glu 




Glu 


Val 


Gin 


Leu 




Ser Leu Ar 
rg 


Gin Arg Glu Ala 






355 










360 






365 


Asp 


Leu 


Ser 


Ser 


Ala 


Arg 


His 


Arg 










370 










375 












<210> 


189 


















<211> 


160 


















<212> 


PRT 


















<213> 


Homo sapien 














<400> 


189 
















Met 


Leu 


Glu 


Ala His Arg 


A 

rg 


Gin 


rg 


His Pro Phe 


Leu Leu Leu Gly 


1 








5 










10 


15 


Thr 


Thr Ala 


Asn Arg Thr 


Gin 


Ser 


Leu 


Asn Tyr Gly 


Cys He Val Glu 








20 










25 




30 


Asn 


Pro 


Gin 


Thr His 


Glu 


Val 


Leu 


His 


Tyr Val Glu 


Lys Pro Ser Thr 






35 










40 






45 


Phe 


He 


Ser 


Asp He He 


Asn 


Cys 


Gly 


He Tyr Leu 


Phe Ser Pro Glu 




50 










55 






60 




Ala 


Leu Lys 


Pro Leu Arg 


Asp 


Val 


Phe 


Gin Arg Asn 


Gin Gin Asp Gly 
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65 








70 




75 


80 


Gin 


Leu 


Glu 


Aso 


Ser Pro Gly 


Leu Trp Pro Gly 


Ala Gly Thr He Arg 










85 




90 


95 


Leu 


Glu 


Gin 


Asp 


Val Phe 


Ser 


Ala Leu Ala Gly 


Gin Gly Gin He Tyr 








100 






105 


110 


Val 


His 


Leu 


Thr 


Asp Gly He 


Trp Ser Gin He 


125 






115 








120 




Ala 


Leu Tyr 


Ala 


Ser Arg Leu 


Tyr Leu Ser Arg 


Tyr Gin Asp Thr His 




130 








135 




140 


Pro 


Glu Arg 


Leu 


Ala Lys 


His 


Thr Pro Gly Gly 


Pro Trp He Arg Gly 


145 








150 




155 


160 




<210> 


190 












<211> 


146 












<212> 


PRT 












<213> 


Homo sapien 
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Asp 


Pro 


Arg 


Ala Ser 


Leu 


Leu Leu Leu Gly 


Asn Val Tyr He His 


1 
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10 


15 


Pro 


Thr 


Ala 


Lys 


Val Ala 


Pro 


Ser Ala Val Leu 


Gly Pro Asn Val Ser 








20 






25 


30 


He 


Gly 


Lys 


Gly 


Val Thr 


Val 


Gly Glu Gly Val 


Arg Leu Arg Glu Ser 






35 








40 


45 


He 


Val 


Leu 


His 


Gly Ala 


Thr 




Thr Cys Val Leu His 




50 








55 




60 


Ser 


He 


Val 


Gly 


Trp Gly Ser 


Thr Val Gly Arg 


Trp Ala Arg Val Glu 


65 








70 




75 


80 


Gly 


Thr 


Pro 


Ser 


Asp Pro 


Asn 


Pro Asn Asp Pro 


Arg Ala Arg Met Asp 










85 








Ser 


Glu 


Ser 


Leu 


Phe Lys Asp 


Gly Lys Leu Leu 


Pro Ala He Thr He 








100 






105 


110 




Gly 


Cys 


Arg 


Val Arg He 


Pro Ala Glu Val 








115 










125 


He 


Val 


Leu 


Pro 


His Lys 


Glu 


Leu Ser Ar Ser 
rg 


Phe Thr Asn Gin He 




130 








135 




140 


He 


Leu 














145 


















<210> 


191 












<211> 


704 












<212> 


PRT 












<213> 


Homo sapien 










<400> 


191 










Glu 


Gly 


Gly 


Cys 


Ala Ala Gly 


Arg Gly Arg Glu 


Leu Glu Pro Glu Leu 


1 








5 




10 


15 




Pro 


Gly 


Pro 


Gly Pro Gly 




Pro Gly Glu Glu Phe 
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25 


30 


Glu 


He 


Val 


Asp 


Arg Ser 


Gin 










35 








40 


7 45 ? 9 


Ala 


Thr 


Arg 


Pro 


Arg Ala Ala 


Glu Gly Trp Ser 


Ala Pro He Leu Thr 




50 








55 




60 


Leu 


Ala 


Arg 


Arg 


Ala Thr Gly 


Asn Leu Ser Ala 


Ser Cys Gly Ser Ala 


65 








70 




75 


80 


Leu 


Arg 


Ala 


Ala 


Ala Gly Leu 


Gly Gly Gly Asp 


Ser Gly Asp Gly Thr 










85 




90 


95 


Ala 


Arg 


Ala 


Ala 


Ser Lys Cys 


Gin Met Met Glu 


Glu Arg Ala Asn Leu 
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100 










105 




110 






Met 


His 


Met 


Met 


Lvs 


Leu 


Ser 


He 


Lys 


Val 


Leu Leu Gin Ser 


Ala 


Leu 






115 










120 






125 






Ser 


Leu 


Gly 


Arg 


Ser 


Leu 


Asp 


Ala 


Asp 


His 


Ala Pro Leu Gin 


Gin 


Phe 




130 










135 








140 






Phe 


Val 


Val 


Met 


Glu 


His 


Cys 


Leu 


Lvs 


His 


Gly Leu Lys Val 


Lvs 


Lys 


145 










150 










155 




160 




Phe 


lie 


Gly 


Gin 




Lvs 


Ser 


Phe 


Phe 


Gly Pro Leu Glu 


Leu 


Val 










165 










170 




175 




Glu 


Lvs 


Leu 


Cys 


Pro 


Glu 


Ala 


Ser 


Asp 


He 


Ala Thr Ser Val 


Arg 


Asn 








180 










185 




190 






Leu 


Pro 


Glu 




Lvs 


Thr 


Ala 


Val 


Gly 


Arq 


Gly Arg Ala Trp 


Leu 


Tvr 






195 










200 






205 






Leu 


Ala 




Met 


Gin 


Lvs 


Lys 




Ala 


Asp 


Tyr Leu Lys Val 


Leu 


He 




210 










215 








220 






Asp 




Lvs 


His 


Leu 


Leu 




Glu 


Phe 


Tvr 


Glu Pro Glu Ala 


Leu 


Met 


225 










230 










235 




240 


Met 


Glu 


Glu 


Glu 


Gly 


Met 


Val 


He 


Val 


Gly 


Leu Leu Val Gly 


Leu 












245 










250 




255 




Val 


Leu 


Asp 


Ala 




Leu 


C 


Leu 


Lys 


Gly 


Glu Asp Leu Asp 




Gin 








260 










265 




270 






Val 


Gly 


Val 


He 


Asp 


Phe 


Ser 


Leu 


Tyr 


Leu 


Lys Asp Val Gin 


Asp 


Leu 






275 










280 






285 






Asp 


Gly 


Gly 


Lvs 


Glu 


His 


Glu 


Arg 


lie 


Thr 


Asp Val Leu Asp 


Gin 


Lvs 




290 










295 








300 






Asn 


Tyr 


Val 


Glu 


Glu 


Leu 




Arq 


His 


Leu 


Ser Cys Thr Val 


Gly 


Asp 


305 










310 










315 




320 




Gin 


Thr 


Lvs 


He 


Asp 


Gly 


Leu 


Glu 


Lys 


Thr Asn Ser Lys 


Leu 


Gin 










325 










330 




335 




Glu 


Glu 


Leu 


Ser 


Ala 


Ala 


Thr 


Asp 


Arg 


He 


Cys Ser Leu Gin 


Glu 


Glu 








340 










345 




350 






Gin 


Gin 


Gin 




Arg 


Glu 


Gin 


Asn 


Glu 


Leu 


He Arg Glu Arg 


Ser 


Glu 






355 










360 






365 






Lvs 


Ser 


Val 


Glu 


He 


Thr 


Lys 


Gin 


Asp 


Thr 


Lys Val Glu Leu 


Glu 


Thr 




370 










375 








380 






Tyr 


Lys 


Gin 


Thr 


Arg 


Gin 


Gly 


Leu 


Asp 


Glu 


Met Tyr Ser Asp 


Val 


Trp 


385 










390 










395 




400 


Lys 


Gin 


Leu 


Lvs 


Glu 


Glu 


Lvs 


Lvs 


Val 


Arg 


Leu Glu Leu Glu 


Lys 


Glu 










405 










410 




415 




Leu 


Glu 


Leu 


Gin 


He 


Gly 


Met 


Lvs 


Thr 


Glu 


Met Glu He Ala 


Met 


Lvs 








420 










425 




430 






Leu 


Leu 


Glu 


Lys 


Asp 


Thr 


His 


Glu 


Lys 


Gin 


Asp Thr Leu Val 


Ala 


Leu 






435 










440 






445 






Arq 


Gin 


Gin 


Leu 


Glu 


Glu 


Val 


Lys 


Ala 


He 


Asn Leu Gin Met 


Phe 


His 




450 










455 








460 






Lys 


Ala 


Gin 


Asn 


Ala 


Glu 




Ser 


Leu 


Gin 


Gin Lys Asn Glu 


Ala 


He 


465 










470 










475 




480 


Thr 


Ser 


Phe 


Glu 


Gly 


Lys 


Thr 


Asn 


Gin 


Val 


Met Ser Ser Met 


Lys 


Gin 










485 










490 




495 




Met 


Glu 


Glu 


Arg 




Gin 


His 


Ser 


Glu 


Arg 


Ala Arg Gin Gly 




Glu 








500 










505 




510 






Glu 


Arg 


Ser 


His 


Lys 


Leu 


Gin 


Gin 


Glu 


Leu 


Gly Gly Arg He 


Gly 


Ala 






515 










520 






525 






Leu 


Gin 


Leu 


Gin 


Leu 


Ser 


Gin 


Leu 


His 


Glu 


Gin Cys Ser Ser 


Leu 


Glu 




530 










535 








540 






Lys 


Glu 


Leu 


Lys 


Ser 


Glu 


Lys 


Glu 


Gin 


Arg 


Gin Ala Leu Gin 


Arg 


Glu 


545 










550 










555 




560 


Leu 


Gin 


His 


Glu 


Lys 


Asp 


Thr 


Ser 


Ser 


Leu 


Leu Arg Met Glu 


Leu 


Gin 
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565 570 575 

Gin Val Glu Gly Leu Lys Lys Glu Leu Arg Glu Leu Gin Asp Glu Lys 
580 585 590 



Ala Glu 


Leu 


Gin Lys He Cys 


Glu Glu Gin Glu Gin Ala Leu 


Gl 


Gl 




595 








600 605 






Met Gly 




His Leu 


Ser Gin 




He 


ys 


610 








615 


620 P 




Glu Val 


Asn 


Gin 


Ala 


Leu Lys 


Gly His Ala Trp Leu Lys Asp 


Asp 


Glu 


625 








630 




640 


Ala Thr 


His 


Cys 


Arg 


Gin Cys 


Gl L Gl Ph 5 




Arg 








645 




U ^ 650 e er e er 


655 




Lys His 


His 


Cys 


Arg 


Asn Cys 


Gly His He Phe Cys Asn Thr 


Cys 


Ser 






660 






665 670 






Ser Asn 


Glu 


Leu Ala Leu Pro 




Val 


Cys 




675 








680 685 






Asp Ser 


Cys 


Hxs 


Thr 


Leu Leu 




Ala 


S r 
er 


690 








695 


700 






<210> 


192 












<211> 


331 












<212> 


PRT 












<213> 


Home 


d sapien 








<400> 


192 












Arg Ala 


Gly 


Ala 


Ser 


Ala Met 


Ala Leu Arg Lys Glu Leu Leu 




er 








5 




10 


15 S 




He Trp 


Tyr 


Ala 


Phe 


Thr Ala 


Leu Asp Val Glu Lys Ser Gly 


ys 


V 1 
a 






20 












Ser Lys 


Ser 


Gin 


Leu 


Lys Val 


Leu Ser His Asn Le T Th 


V 

a 


Leu 




35 








40 U ^ 18 Sn SU 45 X r 






His He 


Pro 


His 


Asp 


Pro Val 


Ala Leu Glu Glu His Phe Arg 


sp 


sp 


50 








55 




Asp Asp 


Gly 


Pro 


Val 


Ser Ser 


Gin Gl T r Met Pr T L 


sn 


yS 


65 








70 








Tyr He 


Leu 


Asp 


Lys 


Val Glu 


Glu Gly Ala Phe Val Lys Glu 




6 








85 










Asp Glu 


Leu 


Cys 


Trp 


Thr Leu 


Thr Ala L° L As T 


Al 


sp 






100 






r 105 ^ yS Sn ^ 110 






Ser Asn 


Gly 


Asn 


Ser 


Met Leu 


Ser Asn Gin Asp Ala Phe Arg 


Leu 


rp 




115 












Cys Leu 


Phe 


Asn 


Phe 


Leu Ser 


Glu Asp Lys Tyr Pro Leu He 


Met 


Val 
a 


130 








135 


140 






Pro Asp 


Glu 


Val 


Glu 


Tyr Leu 


Leu Lys Lys Val Leu Ser Ser 


Met 


S r 


145 








150 








Leu Glu 


Val 


Ser 


Leu 


Gly Glu 


Leu Glu Glu Leu Leu Ala Gin 


Glu 


Al 

a 








165 




170 






Gin Val 


Ala 


Gin 


Thr 


Thr Gly 


Gly Leu Ser Val Trp Gin Phe 


Leu 


Glu 
U 






180 






185 190 






Leu Phe 


Asn 


Ser 


Gly 


Arg Cys 


Leu Arg Gly Val Gly Arg Asp 


Thr 


Leu 




195 








200 205 






Ser Met 


Ala 


He 


His 


Glu Val 


Tyr Gin Glu Leu He Gin Asp 


Val 


Leu 


210 








215 


220 






Lys Gin 


Gly 


Tyr 


Leu 


Trp Lys 


Arg Gly His Leu Arg Arg Asn 


Trp 


Ala 


225 








230 


235 




240 


Glu Arg 


Trp 


Phe 


Gin 


Leu Gin 


Pro Ser Cys Leu Cys Tyr Phe 


Gly 


Ser 








245 




250 


255 




Glu Glu 


Cys 


Lys 


Glu 


Lys Arg 


Gly He He Pro Leu Asp Ala 


His 


Cys 






260 






265 270 
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Val Glu 


Val 


Leu 


Pro 


Asp 


Arg 


Asp 


Gly Lys Arg 


Cys Met Phe Cys 




275 










280 






285 


Val 


Lys Thr 


Ala 


Thr 


Arg 


Thr 


Tyr 


Glu 


Met Ser Ala 


Ser Asp Thr Arg 




290 








295 






300 




Gin 


Arg Gin 


Glu 


Trp 


Thr 


Ala 


Ala 


He 


Gin Met Ala 


He Arg Leu Gin 


305 








310 








315 


320 


Ala 


Glu Gly 


Lys 


Thr 


Ser 


Leu 
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Lys 


Asp Leu 
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330 





<210> 193 
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<212> PRT 
<213> Homo sapien 

<400> 193 









Pro 






Ser 


Val 


Ser 


Ser Gin Thr 


Xle Thr Lys Glu 


1 








5 










10 






Asn 


Arg 


Asn 


Val 


His 


Leu 


Glu 


His 


Ser Glu Gin 


Asn Pro Gl Se 








20 














30 


Ser 
er 


Ala 


Gl 


sp 


Thr 


er 


Al 


Ala 


is 


Gin Val Val 


Leu GJLy GXu Asn 






35 y 


















Leu 


He 


Ala 


Thr 


Ala 


Leu 
U 




Leu 


Ser 


Gl Ser Gl 


g 5 r s r A 
er n er sp 




50 










55 S 






y Sr 60 y 




Leu 


Lys 


Asd 


Val 


Ala 


Ser 


Thr 


Ala 


Gl 


Glu Glu Gl 


As Thr Ser Leu 


65 










70 










P r ^ so" 


Arg 


Glu 


Ser 


Leu 


His 


Pro 


Val 


Thr 


A 


Ser Leu Lys 


Ala Gly Cys His 










85 










90 




Thr 


Lvs 


Gin 


Leu 


Ala 


Ser 


Arc? 


Asn 


C 


Ser Glu Glu 


Lvs S Pr Gl 








100 










105 






Thr 
r 


Ser 


He 


Leu 
eu 


ys 


Glu 
U 


Gly 


Asn 




A Th S 
sp r er 


L ft Ph A 
















120 






125 ^ 6 r9 


Pro 


Val 


Val 


Ser 
er 


Pro 
ro 


Ala 


Asn 


Gl 


V 1 


Glu Gly Val 


Arg Val Asp Gin 














135 










Asp 


Asd 


Asd 


Gin 


As 

P 


Ser 


Ser 
er 


Ser 
er 


eu 




Gin Asn He Ala 


145 










150 








yS 155 Sr 




Val 


Gin 


Thr 


Asp 


Phe 


Lys 


Thr 


Ala 


Asp 


Ser Glu Val 


Asn Thr Asp Gin 










165 










170 


175 


Asp 


He 


Glu 


Lys 


Asn 


Leu 


Asp 


Lys 


Met 


Met Thr Glu 


Arg Thr Leu Leu 








180 










185 




190 


Lys 


Glu 


Arg 


Tyr 


Gin 


Glu 


Val 


Leu 


Asp 


Lys Gin Arg 


Gin Val Glu Asn 






195 










200 






205 


Gin 


Leu 


Gin 


Val 


Gin 


Leu 


Lys 


Gin 


Leu 


Gin Gin Arg 


Arg Glu Glu Glu 




210 










215 






220 




Met 


Lys 


Asn 


His 


Gin 


Glu 


He 


Leu 


Lys 


Ala He Gin 


Asp Val Thr He 


225 










230 








235 


240 


Lys 


Arg 


Glu 


Glu 


Thr 


Lys 


Lys 


Lys 


lie 


Glu Lys Glu 


Lys Lys Glu Phe 










245 










250 


255 


Leu 


Gin 


Lys 


Glu 


Gin 


Asp 


Leu 


Lys 


Ala 


Glu He Glu 


Lys Leu Cys Glu 








260 










265 




270 


Lys 


Gly 


Arg 


Arg 


Glu 


Val 


Trp 


Glu 


Met 


Glu Leu Asp 


Arg Leu Lys Asn 






275 










280 






285 


Gin 


Asp 


Gly 


Glu 


He 


Asn 


Arg 


Asn 


He 


Met Glu Glu 


Thr Glu Arg Ala 




290 










295 






300 




Trp 


Lys 


Ala 


Glu 


He 


Leu 


Ser 


Leu 


Glu 


Ser Arg Lys 


Glu Leu Leu Val 


305 










310 








315 


320 


Leu 


Lys 


Leu 


Glu 


Glu 


Ala 


Glu 


Lys 


Glu 


Ala Glu Leu 


His Leu Thr Tyr 










325 










330 


335 


Leu 


Lys 


Ser 


Thr 


Pro 


Pro 


Thr 


Leu 


Glu 


Thr Val Arg 


Ser Lys Gin Glu 
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340 345 350 



Trp 


Glu 


Thr 


Arg 


Leu 


Asn 


Gly 


Val 


Arq 


He Met Lys Lys 


Asn Va.1 Arcf 






355 










360 




365 




Asp 


Gin 


Phe 


Asn 


Ser 


His 


He 


Gin 


Leu 


Val Arg Asn Gly 


Ala Lys Leu 




370 










375 






380 




Ser 


Ser 


Leu 


Pro 


Gin 


lie 


Pro 


Thr 


Pro 


Thr Leu Pro Pro 


Pro Pro Ser 


385 










390 








395 




Glu 


Thr 


Asp 


Phe 


Met 




Gin 


Val 


Phe 


Gin Pro Ser Pro 


Ser Leu Ala. 










405 










410 


415 


Pro 


Arg 


Met 


Pro 


Phe 


Ser 


He 


Gly 


Gin 


Val Thr Met Pro 


Met Val Met 








420 










425 




430 


Pro 


Ser 


Ala 


Asp 


Pro 


Arq 


Ser 


Leu 


Ser 


Phe Pro He Leu 


Asn Pro Ala 






435 










440 




445 




Leu 


Ser 


Gin 


Pro 


Ser 


Gin 


Pro 


Ser 


Ser 


Pro Leu Pro Gly 


Ser His Gly 




450 










455 






460 


Arg 


Asn 


Ser 


Pro 


Gly 


Leu 


Gly 


Ser 


Leu 


Val Ser 




465 










470 








475 






<210> 


194 


















<211> 


241 


















<212> 


PRT 


















<213> 


Homo sapien 














<400> 


194 
















Met 






Glu 


Ser 


Ala 


Arq 


Ser 


Leu 


Gly Lys Gly Ser 


Ala Pro Pro 


1 








5 










10 




Gly 


Pro 


Val 


Pro 


Glu 


Gly 


Ser 


He 


Arg 


He Tyr Ser Met 


Arg Phe Cys 








20 










25 




30 


Pro 


Phe 


Ala 


Glu 


Arg 


Thr 


Arq 


Leu 


Val 


Leu Lys Ala Lys 


Gly He Arg 






35 










40 




45 




His 


Glu 


Val 


He 


Asn 


He 


Asn 




L 


Asn Lys Pro Glu 


Trp Phe Phe 




50 










55 






60 


Lys 


Lys 


Asn 


Pro 


Phe 


Gly 




Val 


Pro 


Val Leu Glu Asn 


Ser Gin Gly 


65 










70 








75 


80 


Gin 


Leu 


He 


Tyr 


Glu 


Ser 


Ala 


He 


Thr 


Cys Glu Tyr Leu 


Asp Glu Ala 










85 










90 


95 


Tyr 


Pro 


Gly 


Lys 


Lys 


Leu 


Leu 


Pro 


Asp 


Asp Pro Tyr Glu 


Lys Ala Cys 








100 










105 




110 


Gin 


Lys 


Met 


He 


Leu 


Glu 


Leu 


Phe 




Lys Val Pro Ser 


Leu Val Gly 






115 










120 




125 


Ser 


Phe 


He 


Arg 


Ser 


Gin 


Asn 


Lys 


Glu 


Asp Tyr Ala Gly 


Leu Lys Glu 




130 










135 






140 




Glu 


Phe 


Arg 


Lys 


Glu 


Phe 


Thr 


L 


Leu 


Glu Glu Val Leu 


Thr Asn Lys 


145 










150 








155 




Lys 


Thr 


Thr 


Phe 


Phe 


Gly 


Gly 


Asn 


Ser 


He Ser Met He 


Asp Tyr Leu 










165 










170 




He 


Trp 


Pro 


Trp 


Phe 


Glu 


Arg 


Leu 


Glu 


Ala Met Lys Leu 


Asn Glu Cys 








180 










185 






Val 


Asp 


His 


Thr 


Pro 


Lys 


Leu 


Lys 


Leu 


Trp Met Ala Ala 


Met Lys Glu 






195 










200 




205 


Asp 


Pro 


Thr 


Val 


Ser 


Ala 


Leu 


Leu 


Thr 


Ser Glu Lys Asp 


Trp Gin Gly 




210 










215 






220 




Phe 


Leu 


Glu 


Leu 


Tyr 


Leu 


Gin 


Asn 


Ser 


Pro Glu Ala Cys 


Asp Tyr Gly 


225 










230 








235 


240 


Leu 
























<210> 


195 
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<211> 138 
<212> PRT 
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Gl 




Lys 
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Gl 


Gl As L u Gl Gl 11 
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U P SU io" n 6 yS SU 
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Arg 


Glu 


Arg 


Gl 


rg 
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Leu Thr Thr Gin Ar Gin Leu Met 
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25 9 SU 30 




rg 


Ala 


Gl 


CI 

U 


Gl 

y 


ys 


Gl Pr L Al Gl A Gl 


er eu 






35 








y 40° er ^ a n 4 ^ g y 




Glu 


His 


Met 


Lys 


Leu 


He 


Leu Arg Asp Lys Glu Lys Glu Val 


Glu Cys 




50 










55 60 




Gin 


Gin 


Glu 


His 


He 


His 


Glu Leu Gin Glu Leu Lys Asp Gin 


Leu Glu 
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70 


75 


80 




Gin 


Leu 


Gin 


Gly 




His Aircj Lys Val Gly Glu Thx Sex 
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90 


95 


Leu 


Ser 


Gin 


Arg 


Glu 


Gin 


Glu He Val Val Leu Gin Gin Gin 


Leu Gin 
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Glu 


Ala 


Arg 


Glu 


Gin 
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Glu Leu Lys Glu Gin Ser Leu Gin 


Ser Gin 
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Leu 
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Glu 


Ala 


Gin 


Arg 


Ala Leu Ala Gin 
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Lys 
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Gl Gl Th A Gl Gl 


11 Th 
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Met 


Leu 


Thr 


Glu 
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100 










110 


Pro 


Ser 


He 


Ser 


Ser 


Asn 


Asn 


Ser Asn Pro Val Glu Asp 








115 










120 125 




Val 


Ala 


Phe 


Thr 


Cys 


Glu 


Pro 


Glu Val Gin Asn Thr Thr 


Tyr Leu Trp 




130 










135 






Trp 


Val 


Asn 


Gly 


Gin 


Ser 


Leu 


Pro Val Ser Pro Lys 




145 










150 




155 






<210> 


211 














<211> 


92 














<212> 


PRT 














<213> 


Homo sapien 










<400> 


211 














Glu 


Ser 


Pro 


Ser 


Ala 


Pro 


Pro His Arg Trp cys lie 


Pro Trp Gin 










5 






10 


15 


Arg 


Leu 


Leu 


Leu 


Thr Ala 


Ser 


Leu Leu Thr Phe Trp Asn 


Pro Pro Thr 








20 








25 


30 


Thr 


Ala 


Lys 


Leu 


Thr 


He 


Glu 


Ssr Thr Piro Ph© Asn Va.1 


Ala Glu Gly 






35 










40 45 




ys 


Glu 


Val 


Leu 


Leu 


Leu 


Val 


His Asn Lsu Piro Gin His 


Leu Phe Gly 




50 










55 


60 




Tyr 


Ser 


Trp 


Tyr 


Lys Gly Glu 


Arg Val Asp Gly Asn Arg 


Gin He He 












70 






80 


y 


Tyr 


Val 


He 


Gly Thr Gin 


pin al a Thr- Prn (Z~\ \7 

ij.Ln Ala iiir Jrro vjxy 












85 






90 






<210> 


212 














<211> 


142 














<212> 


PRT 














<213> 


Homo sapien 










<400> 


212 












Glu 


Lys Gin 


Lys Asn Lys 


Glu 


Phe Ser Gin Thr Leu Glu 


Asn Glu Lys 


1 








5 






10 


15 


Asn 


Thr 


Leu 


Leu 


Ser 


Gin 


He 


Ser Thr Lys Asp Gly Glu 


Leu Lys Met 








20 








25 


30 
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Leu 


Gin 


Glu 


Glu 


Val 


Thr 


Lys 


Met 


Asn 


Leu 


Leu Asn Gin Gin He 


Gin 






35 










40 






45 




Glu 


Glu 


Leu 


Ser 


Arg 


Val 


Thr 


Lys 


Leu 


Lys 


Glu Thr Ala Glu Glu 


Glu 




50 










55 








60 




Lys 




Asp 


Leu 


Glu 


Glu 


Arg 


Leu 


Met 


Asn 


Gin Leu Ala Glu Leu 


Asn 


65 










70 










75 


80 


Gly 


Ser 


He 


Gly 


Asn 


Tyr 


Cys 


Gin 


Asp 


Val 


Thr Asp Ala Gin He 


Lys 










85 










90 


95 




Asn 


Glu 


Leu 


Leu 


Glu 


Ser 


Glu 


Met 


Lys 


Asn 


Leu Lys Lys Cys Val 


Ser 








100 










105 




110 




Glu 


Leu 


Glu 


Glu 


Glu 


Lys 


Gin 


Gin 


Leu 


Val 


Lys Glu Lys Thr Lys Val 






115 










120 






125 




Glu 


Ser 


Glu 


He 


Arg 


Lys 


Glu 


Tyr 


Leu 


Glu 


Lys He Gin Gly 






130 










135 








140 





<210> 213 
<211> 142 
<212> PRT 
<213> Homo sapien 



<400> 213 



Gly 


Gly 


Tyr 


Gly 


Gly 


Gly 


Tyr 


Gly 


Gly 


Val 


Leu Thr Ala Ser Asp Gly 


1 








5 










10 


15 


Leu 


Leu 


Ala 


Gly 


Asn 


Glu 


Lys 


Leu 


Thr 


Met 


Gin Asn Leu Asn Asp Arg 








20 










25 




30 


Leu 


Ala 


Ser 


Tyr 


Leu 


Asp 


Lys 


Val 


Arg 


Ala 


Leu Glu Ala Ala Asn Gly 






35 










40 






45 


Glu 


Leu 


Glu 


Val 


Lys 


He 


Arg 


Asp 


Trp 


Tyr 


Gin Lys Gin Gly Pro Gly 




50 










55 








60 


Pro 


Ser 


Arg 


Asp 


Tyr 


Ser 


His 


Tyr 


Tyr 


Thr 


Thr He Gin Asp Leu Arg 


65 










70 










75 80 


Asp 


Lys 


He 


Leu 


Gly 


Ala 


Thr 


He 


Glu 


Asn 


Ser Arg He Val Leu Gin 










85 










90 


95 


He 


Asp 


Asn 


Ala 


Arg 


Leu 


Ala 


Ala 


Asp 


Asp 


Phe Arg Thr Lys Phe Glu 








100 










105 




110 


Thr 


Glu 


Gin 


Ala 


Leu 


Arg 


Met 


Ser 


Val 


Glu 


Ala Asp He Asn Gly Leu 






115 










120 






125 


Arg 


Arg 


Val 


Leu 


Asp 


Glu 


Leu 


Thr 


Leu 


Ala 


Arg Thr Asp Leu 




130 










135 








140 



<210> 214 
<211> 129 
<212> PRT 
<213> Homo sapien 

<400> 214 



Val 


Met 


Arg 


Val 


Asp 


Phe 


Asn 


Val 


Pro 


Met 


Lys Asn Asn Gin He Thr 


1 








5 










10 


15 


Asn 


Asn 


Gin 


Arg 


He 


Lys 


Ala 


Ala 


Val 


Pro 


Ser He Lys Phe Cys Leu 








20 










25 




30 


Asp 


Asn 


Gly 


Ala 


Lys 


Ser 


Val 


Val 


Leu 


Met 


Ser His Leu Gly Arg Pro 






35 










40 






45 


Asp 


Gly 


Val 


Pro 


Met 


Pro 


Asp 


Lys 


Tyr 


Ser 


Leu Glu Pro Val Ala Val 




50 










55 








60 


Glu 


Leu 


Arg 


Ser 


Leu 


Leu 


Gly 


Lys 


Asp 


Val 


Leu Phe Leu Lys Asp Cys 


65 










70 










75 80 


Val 


Gly 


Pro 


Glu 


Val 


Glu 


Lys 


Ala 


Cys 


Ala 


Asn Pro Ala Ala Gly Ser 










85 










90 


95 
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Val lie Leu Leu Glu Asn Leu Arg Phe His Val Glu Glu Glu Gly Lys 

100 105 110 

Gly Lys Asp Ala Ser Gly Asn Lys Val Lys Ala Glu Pro Ala Lys lie 
115 120 125 

Glu 



<210> 215 
<211> 148 
<212> PRT 
<213> Homo sapien 

<400> 215 



Met 


Ala 


Thr 


Leu 


Lys 


Glu 


Lys 


Leu 


He 


Ala Pro Val Ala 


Glu Glu Glu 


1 








5 










10 


15 


Ala 


Thr 


Val 


Pro 


Asn 


Asn 


Lys 


He 


Thr 


Val Val Gly Val 


Gly Gin Val 








20 










25 




30 


Gly 


Met 


Ala 


Cys 


Ala 


He 


Ser 


He 


Leu 


Gly Lys Ser Leu 


Ala Asp Glu 






35 










40 




45 




Leu 


Ala 


Leu 


Val 


Asp 


Val 


Leu 


Glu 


Asp 


Lys Leu Lys Gly 


Glu Met Met 




50 










55 






60 




Asp 


Leu 


Gin 


His 


Gly 


Ser 


Leu 


Phe 


Leu 


Gin Thr Pro Lys 


He Val Ala 


65 










70 








75 


80 


Asp 


Lys 


Asp 


Tyr 


Ser 


Val 


Thr 


Ala 


Asn 


Ser Lys He Val 


Val Val Thr 










85 










90 


95 


Ala 


Gly 


Val 


Arg 


Gin 


Gin 


Glu 


Gly 


Glu 


Ser Arg Leu Asn 


Leu Val Gin 








100 










105 




110 


Arg 


Asn 


Val 


Asn 


Val 


Phe 


Lys 


Phe 


He 


He Pro Gin He 


Val Lys Tyr 






115 










120 




125 




Ser 


Pro 


Asp 


Cys 


lie 


He 


He 


Val 


Val 


Ser Asn Pro Val 


Asp He Leu 



130 135 140 

Thr Tyr Val Thr 
145 



<210> 216 
<211> 527 
<212> PRT 
<213> Homo sapien 



<400> 216 



Gin 


Arg 


Ala 


Pro 


Gly 


He 


Glu 


Glu 


Lys 


Ala Ala Glu Asn 


Gly Ala Leu 


1 








5 










10 


15 


Gly 


Ser 


Pro 


Glu 


Arg 


Glu 


Glu 


Lys 


Val 


Leu Glu Asn Gly 


Glu Leu Thr 








20 










25 




30 


Pro 


Pro 


Arg 


Arg 


Glu 


Glu 


Lys 


Ala 


Leu 


Glu Asn Gly Glu 


Leu Arg Ser 






35 










40 




45 




Pro 


Glu 


Ala 


Gly 


Glu 


Lys 


Val 


Leu 


Val 


Asn Gly Gly Leu 


Thr Pro Pro 




50 










55 






60 




Lys 


Ser 


Glu 


Asp 


Lys 


Val 


Ser 


Glu 


Asn 


Gly Gly Leu Arg 


Phe Pro Arg 


65 










70 








75 


80 


Asn 


Thr 


Glu 


Arg 


Pro 


Pro 


Glu 


Thr 


Gly 


Pro Trp Arg Ala 


Pro Gly Pro 










85 










90 


95 


Trp 


Glu 


Lys 


Thr 


Pro 


Glu 


Ser 


Trp 


Gly 


Pro Ala Pro Thr 


He Gly Glu 








100 










105 




110 


Pro 


Ala 


Pro 


Glu 


Thr 


Ser 


Leu 


Glu 


Arg 


Ala Pro Ala Pro 


Ser Ala Val 






115 










120 




125 




Val 


Ser 


Ser 


Arg 


Asn 


Gly 


Gly 


Glu 


Thr 


Ala Pro Gly Pro 


Leu Gly Pro 




130 










135 






140 
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Ala 


Pro Lys 


Asn 


Gly 


Thr 


Leu 


Glu Pro Gly 


Thr Glu Ara Ar Ala Pro 


145 








150 








155 5 160 


Glu 


Thr Gly 


Gly 


Ala 




Ara 


Ala Pro Gly 


Ala Gly Arg Leu Asp Leu 








165 








170 


175 


Gly 


Ser Gly 


Gly 


Arg 


Ala 


Pro 


Val Gly Thr 


Gly Thr Ala Pro Gly Gly 






180 








185 




190 


Gly 


Pro Gly 




Glv 


Val 


Asd 


Ala Lys 


Ala 






195 










200 




205 


A 




Pro 


Pro 


Pro 


Pro 


Pro Leu 


Pro 






210 








215 






220 


Pro 


Arg Arg 


Leu 


Glu 


Pro 




Pro Pro Arg 


Ala Arg Pro Glu Val Ala 


225 








230 










Pro 


Glu Gly 


Glu 


Pro 


Gly 


Ala 


Pro Asp Ser 


Arg Ala Gly Gly Asp Thr 








245 








250 


255 


Ala 


Leu Ser 


Gly 


Asp 


Gly 


Asd 


Pro Pro 


Lys 








260 








265 




270 


Glu 


Met Pro 


Arg 


Leu 


Phe 


Leu 


Asp Leu Gly 






275 










280 




285 


Glu 


Gin lie 


_ 


Ala 


Ar 


Leu 


Ser Arg Leu 






290 








295 






300 


Leu 




Thr 


Pro 


Phe 




Gly Pro Gly 




305 








310 








315 320 


Glu 


Glv Ala 


As 

P 


Ala 


Gly 


Ala 


Ala Gly Gly 


Glu Ala Gly Gly Ala Gly 








325 








330 


335 


Ala 


Pro Gly 


Pro 


Ala 


Glu 


Glu 


Asp Gly Glu 








340 








345 




350 


Glu 


Glu Asp 




Glu 


Ala 


Ala 


Ala Pro Gly 






355 










360 




365 


Gly 


Pro Gly 


A 


Ala 


Ara 


Ala 


Ala Pro 


Val 


Pro Val Val Val Ser Ser 




370 








375 








Ala 


Asp Ala 


Asp 


Ala 


Ala 


A 

rg 


Pro Leu 


Arg 


Gly Leu Leu Lys Ser Pro 


385 








390 








395 400 


Arg 


Gly Ala 


Asp 


Glu 




Glu 


Asp Ser 


Glu 










405 








410 


415 


Met 


Val Ser 


Phe 


His 


Glv 


sp 


Val Thr 


Val 


Tyr Leu Phe Asp Gin Glu 






420 








425 






Thr 


Pro Thr 




Glu 


Leu 


Ser 


Val Gin 


Ala 


Pro Pro Glu Gly Asp Thr 




435 










440 




445 


Asd 




Thr 


Pro 


Pro 


Ala 


Pro Pro 


Thr 






450 








455 






460 


Pro 


Gly Asp 


Gly 


Phe 


Pro 


Ser 


Asn Asp 


Ser 


Gly Phe Gly Gly Ser Phe 


465 








470 








475 " 480 


Glu 


Trp Ala 


Glu 


Asp 


Phe 


Pro 


Leu Leu 


Pro 


Pro Pro Gly Pro Pro Leu 








485 








490 


495 


Cys 


Phe Ser 


Arg 


Phe 


Ser 


Val 


Ser Pro 


Ala 


Leu Glu Thr Pro Gly Pro 






500 








505 




510 


Pro 


Ala Arg 


Ala 


Pro 


Asp 


Ala 


Arg Pro Ala 


Gly Pro Val Glu Asn 




515 










520 




525 



<210> 217 
<211> 466 
<212> DNA 
<213> Homo sapien 

<400> 217 

gaatggtgcc tgtcctgctg tctctgctgc tgcttctggg tcctgctgtc ccccaggaga 60 

accaagatgg tcgttactct ctgacctata tctacactgg gctgtccaag catgttgaag 120 

acgtccccgc gtttcaggcc cttggctcac tcaatgacct ccagttcttt agatacaaca 180 
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gtaaagacag gaagtctcag cccatgggac tctggagaca ggtggaagga atggaggatt 240 

ggaagcagga cagccaactt cagaaggcca gggaggacat ctttatggag accctgaaag 300 

acatcgtgga gtattacaac gacagtaacg ggtctcacgt attgcaggga aggtttggtt 360 

gtgagatcga gaataacaga agcagcggag cattctggaa atattactat gatggaaagg 420 

actacattga attcaacaaa gaaatcccag cctgggtccc cttcga 466 

<210> 218 
<211> 381 
<212> DNA 
<213> Homo sapien 

<400> 218 

gagtttcctt cgcaagttca tgtggggtac cttcccaggc tgcctggctg accagctggt 60 

tttaaagcgc cggggtaacc agttggagat ctgtgccgtg gtcctgaggc agttgtctcc 120 

acacaagtac tacttcctcg tgggctacag tgaaactttg ctgtcctact tttacaaatg 180 

tcctgtgcga ctccacctcc aaactgtgcc ctcaaaggtt gtgtataagt acctctagaa 240 

caatcccctt ttttccatca agctgtagcc tgcagagaat ggaaacgtgg gaaaggaatg 300 

gtatgtgggg gaaatgcatc ccctcagagg actgaggcat agtctctcat ctgctattga 360 

ataaagacct tctatcttgt a 381 

<210> 219 

<211> 1293 

<212> DNA 

<213> Homo sapien 

<400> 219 

gaggggaggc gcatggcggg gatggcgctg gcgcgggcct ggaagcagat gtcctggttc 60 

tactaccagt acctgctggt cacggcgctc tacatgctgg agccctggga gcggacggtg 120 

ttcaattcca tgctggtttc cattgtgggg atggcactat acacaggata cgtcttcatg 180 

ccccagcaca tcatggcgat attgcactac tttgaaatcg tacaatgacc aagatgcgac 240 

caggatcaga ggttccttgg ggaagaccca ccctacgaag ttggaatgag accatcagat 300 

gtgataagaa actcttctag atgtcaacat aaccaacctt ataaagacta aaattcatga 360 

gtagaacagg aaaatcatcc tgactcatgt gttgtgttct ttatttttaa ttttcaaaga 420 

ggctcttgta tagcagtttt tgtctatttt aacattgtag tcatttgtac tttgatatca 480 

gtattttctt aacctttgtg actgtttcaa tattaccccc gtgaaagctt ttcttaatgt 540 

aactttgagt acattttaat tgccttctat ttttaaaact caaaatcatt agttgggctt 600 

tactgttctt gctattgtat ggcatataca tctgcctgga tatatttcta ctcttgacca 660 

aagttttgta aagaacaata taagatttcg ggtaggggta tggggaggga agatatttta 720 

ttgagaacta cttaacaaaa gatttatctg taagcttgaa ctcaggagta cagttttagc 780 

tatctagact ctaacagctt ttgctttaaa attattaaag tgtttcttaa tgaaaaagaa 840 

aagatcttgc taaagttaaa ataaggaaca tttcaccttt taaatattta attcttatgt 900 

ggacttattt ccagaaaact ttggtgataa ttcttgagac aaaaggtggt taagtagcat 960 

tattatgtaa tgcttatata ccatagagtt tttaatagaa gagaaatcca tttcctccga 1020 

gggtcactat taacaatgta cttccttaaa tttagtttaa tgattgtaat gggtgctgca 1080 

tttgcacatt gcattaagtt atgatgagac gaattgttgt taaaaattat agcaaaaaga 1140 

aatgtaaact tggttaaaat cctttcactc tttgtattgt tttttttaag gtttttattc 1200 

cttaaatgta aaatgactac ctaatttttt gatgtaaata cattaaattc aaagagaaaa 1260 

aaaatcaaaa aaaaaaaaaa aaaaaaactc gag 1293 

<210> 220 
<211> 983 
<212> DNA 
<213> Homo sapien 

<400> 220 

caggttattc tgatcctgcc gcctgtcttc cctgtaagag tggagcctcg aggtgtacct 60 

taaagtgacc ggaatgttag agatgcaatt tgcagagctg gggcaaggaa gggctccttg 120 

tcactgtagt tactttcctt gcagtggcca aatgcccaat aagaaggaat acatgaccac 180 
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tgctgtgggg agtcagcagg tgcgtgatgc agctggccac actccatcca cggccatgac 240 

ataaaacaga caagaagtaa ggctggactg taacacctca aggcctgctc cagtgaccca 300 

ctttcttcag agaggctcta ccacacacac aaccaccttc caaatttaca ctcagatcac 360 

tacaccatgt ctcccaagtt aaaacatgta tccacctaga ctttaaatgt gctttgtaac 420 

tgttgatggc actgtacaga gggccaaagt atttcccatc agatagcatt tttctgaacc 480 

catgcctctt gggacgagat cacaggactt gacccatcat caaataggac caggtgacct 540 

acagagacat cacaatgatg gcttcctaca gtcaagtcca tttccaataa tgctctcatc 600 

taagagaacc catgaacctt atttgaatcc tggttcaaac aaaaacctta aattatttat 660 

gagacaatta taaacttgat agattttgat gtgtgaaggt atttatgaat atttttagtc 720 

agtgatggta tactgttaag gaaaaggttc atattttagg gacaaaggct gaaacattta 780 

tggacagagt gatatgatat ctgggatttg ttttaggatg aagtgggagg gaggaaatga 840 

atggaaatag tgttgaaaca gtattggcca cgagtcagct attgtgtgct aagacgctcc 900 

tcacaccagt ctactctgta tgtgtttgaa tatctctgta ataaacttaa caaggaaaaa 960 

aaaaaaaaaa aaaaaaactc gag 983 

<210> 221 
<211> 373 
<212> DNA 
<213> Homo sapien 

<400> 221 
cattttatgg gttaattttt tattaaatag 
tattcaatga tacattcgga aaataaatgt 
tttcagtact tttaggtaag attaatccaa 
ataaggggaa aatacaataa tgttgagaaa 
ttgagaaatg gacataaatg atttagtatt 
gcttttgtgt agcgttagat gaattaaata 
tttccagacc ata 

<210> 222 
<211> 544 
<212> DNA 
<213> Homo sapien 

<400> 222 
cagagatgct gctgctacaa aggatcggtg 
gaaagaaaga agcaatgtgg ttataacaag 
ggaacataat gtatttcaaa acaaaataca 
gatgaagttt cagcaagttc gtgagcagat 
aaatggtata ctgagagatg cagtcagcaa 
tgcagaacta aataaactac gccaggatta 
aacaggaaag ctacagcaag aggaagtcca 
gttgaaggtt caactacaag aagctgagag 
gaagagaaca gcggaacatg aggcagcaca 
agaa 

<210> 223 
<211> 316 
<212> DNA 
<213> Homo sapien 

<400> 223 
gaggcaaggg atatgcttta gtgcctatta 
cagttgaatc tgctgaagga aaatctgaag 
aggaagcaga aatggaagaa agtggacgaa 
atatcttggt ttcttctaca ccaataaatg 
agcagtgtga acctgctgaa agtcagccag 
gtaactctaa cagttg 



caataagata cttttataac 
ataaaatatg aaaaagtact 
ctaaacacta gcatatgtta 
gcaaactcaa agcatagatc 
tttaaagaga gtgaaaaatc 
acatatgcac atatagcttt 



tcaataaaat 60 

aaaaagcatt 120 

tacagtaata 180 

aatgaaaaaa 240 

attattttat 300 

gcgatacaaa 360 
373 



taagcagtta acccaggaaa 
gatgaaagat cgaattggaa 
tgtcagttat caagagactc 
ggaggcagag atagctcact 
cactacaaat caactggaaa 
tgctaggttg gtgaatgagc 
aaagaagaat gctgagcaag 
aaggtgggaa gaagttcaga 
gctagattta cagagtaaat 



tgatgacaga 60 

cattagaaaa 120 

aacagatgca 180 

tgaagcagga 240 

gcaagcagtc 300 

tgactgagaa 360 

cagctactca 420 

gctacatcag 480 

ttgtggccaa 540 
544 



tagttaattc ttcaactcca 
aagtaaatga aacattagtt 
gtgcaactcc tgttaactgt 
aaggacagac tgtgttagac 
aagcacttct gagaggaaga 



aagtctaaaa 60 
atacccactg 120 
gaacagcctg 180 
aaggtggctg 240 
tgtttgcaag 300 
316 



WO 02/092001 



101 



PCT7US02/14975 



<210> 224 
<211> 1583 
<212> DNA 
<213> Homo sapien 



<400> 224 

cagaccacgt ctgccctcgc cgctctagcc ctgcgcccca gcccggccgc ggcacctccg 60 

cctcgccgcc gctaggtcgg ccggctccgc ccggctgccg cctaggatga atatcatgga 120 

cttcaacgtg aagaagctgg cggccgacgc aggcaccttc ctcagtcgcg ccgtgcagtt 180 

cacagaagaa aagcttggcc aggctgagaa gacagaattg gatgctcact tagagaacct 240 

ccttagcaaa gctgaatgta ccaaaatatg gacagaaaaa ataatgaaac aaactgaagt 300 

gttattgcag ccaaatccaa atgccaggat agaagaattt gtttatgaga aactggatag 360 

aaaagctcca agtcgtataa acaacccaga acttttggga caatatatga ttgatgcagg 420 

gactgagttt ggcccaggaa cagcttatgg taatgccctt attaaatgtg gagaaaccca 480 

aaaaagaatt ggaacagcag acagagaact gattcaaacg tcagccttaa attttcttac 540 

tcctttaaga aactttatag aaggagatta caaaacaatt gctaaagaaa ggaaactatt 600 

gcaaaataag agactggatt tggatgctgc aaaaacgaga ctaaaaaagg caaaagctgc 660 

agaaactaga aattcatctg aacaggaatt aagaataact caaagtgaat ttgatcgtca 720 

agcagagatt accagacttc tgctagaggg aatcagcagt acacatgccc atcaccttcg 780 

ctgtctgaat gactttgtag aagcccagat gacttactat gcacagtgtt accagtatat 840 

gttggacctc cagaaacaac tgggaagttt tccatccaat tatcttagta acaacaatca 900 

gacttctgtg acacctgtac catcagtttt accaaatgcg attggttctt ctgccatggc 960 

ttcaacaagt ggcctagtaa tcacctctcc ttccaacctc agtgacctta aggagtgtag 1020 

tggcagcaga aaggccaggg ttctctatga ttatgatgca gcaaacagta ctgaattatc 1080 

acttctggca gatgaggtga tcactgtgtt cagtgttgtt ggaatggatt cagactggct 1140 

aatgggggaa aggggaaacc agaagggcaa ggtgccaatt acctacttag aactgctcaa 1200 

ttaagtaggt ggactatgga aaggttgccc atcatgactt tgtatttata tacaattaac 1260 

tctaaataaa gcaggttaag tatcttccat gttaatgtgt taagagactg aaaataccag 1320 

ccatcagaaa ctggcctttt tgccaataaa gttgcatggt aaatatttca ttacagaatt 1380 

tatgttagag ctttcatgcc aagaatgttt tcttacaaaa ttctcttttt attgaggttt 1440 

cactaataag cagcttctac ttttgagcct caacttaaag cagaactgtt ttttactgga 1500 

tttttcatta acagcaagct ttttttttta tgtaaaataa atctattgtg aattgaaaaa 1560 

aaaaaaaaaa aaaaaaactc gag 1583 

<210> 225 
<211> 491 
<212> DNA 
<213> Homo sapien 



.<400> 225 

gaacaacatc atcttgaatc actagataga ctcttgacgg aaagcaaagg ggaaatgaaa 60 

aaggaaaata tgaagaaaga tgaagcttta aaagcattac agaaccaagt atctgaagaa 120 

acaatcaagg ttaggcaact agattcagca ttggaaattt gtaaggaaga acttgtcttg 180 

catttgaatc aattggaagg aaataaggaa aagtttgaaa aacagttaaa gaagaaatct 240 

gaagaggtat attgtttaca gaaagagcta aagataaaaa atcacagtct tcaagagact 300 

tctgagcaaa acgttattct acagcatact cttcagcaac agcagcaaat gttacaacaa 360 

gagacaatta gaaatggaga gctagaagat actcaaacta aacttgaaaa acaggtgtca 420 

aaactggaac aagaacttca aaaacaaagg gaaagttcag ctgaaaagtt gagaaaaatg 480 

gaggagaaat g 491 

<210> 226 

<211> 483 

<212> DNA 

<213> Homo sapien 



<400> 226 

cagccgcacg ccgcggagca ggggctcgga ggtcccggga ttacggtgct cgagcacgct 



60 
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ggtgggaaag gacccgggac ttgaacagtg ttgtgcggcg ccatgcaggt ctccagcctc 120 

aatgaggtga agatttacag cctcagctgc ggcaagtccc ttcctgagtg gctttctgat 180 

aggaagaaga gagcgctaca gaagaaagat gtagatgtcc gtaggagaat tgaacttatt 240 

caggactttg aaatgcctac tgtgtgtacc actattaagg tgtcaaaaga tggacagtac 300 

attttagcaa ctggaacata taaacctcgg gttcgatgtt atgacaccta tcaattatcc 360 

ttgaagtttg aaaggtgttt agattcagaa gttgtcacct ttgaaatttt gtctgatgac 420 

tactcaaaga ttgtcttctt acataatgat agatacattg aatttcattc gcaatcaggt 480 

ttt ' 483 

<210> 227 
<211> 486 
<212> DNA 
<213> Homo sapien 

<400> 227 

gagcctcgct aagctccgac tctgggcggc accgggcgtc ccacgatgcc gaagaacaag 60 

aagcggaaca ctccccaccg cggtagcagt gctggcggcg gcgggtcagg agcagccgca 120 

gcgacggcgg cgacagcagg tggccagcat cgaaatgttc agccttttag tgatgaagat 180 

gcatcaattg aaacagtgag ccattgcagt ggttatagcg atccttccag ttttgctgaa 240 

gatggaccag aagtccttga tgaggaagga actcaagaag acctagagta caagttgaag 300 

ggattaattg acctaaccct ggataagagt gcgaagacaa ggcaagcagc tcttgaaggt 360 

attaaaaatg cactggcttc aaaaatgctg tatgaattta ttctggaaag gagaatgact 420 

ttaactgata gcattgaacg ctgcctgaaa aaaggtaaga gtgatgagca acgtgcagct 480 

gcagcg 486 

<210> 228 
<2U> 494 
<212> DNA 
<213> Homo sapien 

<400> 228 

gaggccagga ctccgggaat gcgagcaggc cccttattct cccagtggcc tcggtctgtc 60 

cccacagcgg cccggtcagg gttgcccgag ccccaaggcg gggggcggca ccggggtgct 120 

gaaagggaca gaatgctttg acctccaagc tgttttaaat ctagtagata agccagatcc 180 

tgtgttgcca taagcccttg gcccacattt aagtgggaat gcagctagct tggatgtctg 240 

aaactttgta agcgccttct gtctgaatcc tgaacacagg caccaagact actgaagaag 300 

ctcgtcattc ttgtgcaggg atagccacac aagcaaacat gtttgcaaaa cttgaaagaa 360 

agaaaattgc agaaagaaga cttgctgttc ttaagaggcc caggaaggtg ctacttagga 420 

atcccaccgg cttgtgaagc aagggaatca agtttgcctt caatggggaa cttgacttca 480 

ggaaaatgaa cttt 494 

<210> 229 
<211> 465 
<212> DNA 
<213> Homo sapien 

<400> 229 

gtcagagagc tggtataacc tcctgttgga catgcagaac cgactcaata aggtcatcaa 60 

aagcgtgggc aagattgagc actccttctg gagatccttt cacactgagc gaaagacaga 120 

accagccaca ggcttcatcg atggtgatct gattgaaagt ttcctagata tcagccgccc 180 

taagatgcag gaggttgtgg caaacttgca gtatgatgat ggcagtggta tgaagcggga 240 

ggcaactgca gatgacctca tcaaagtcgt ggaggaacta actcggatcc attagccaag 300 

gacaggatct cttttcctga ccctcctaaa ggcgttgccc tcctatcctc ccttccttgc 360 

ccacccttgg tttctttggc atgggaaggt tttccttaac cacttgccct agagccacca 420 

gtgaccttgt gtggaaacag ggtttttttt acttaaaaca gttca 465 



<210> 230 
<211> 495 
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<212> DNA 

<213> Homo sapien 

<400> 230 

caggggaaag ggtgtttggc cttgaccagc cactgctgac ctcaatctca gacctacaga 60 

tggtgaatat ctccctgcga gtgttgtctc gacccaatgc tcaggagctt cctagcatgt 120 

accagcgcct agggctggac tacgaggaac gagtgttgcc gtccattgtc aacgaggtgc 180 

tcaagagtgt ggtggccaag ttcaatgcct cacagctgat cacccagcgg gcccaggtat 240 

ccctgttgat ccgccgggag ctgacagaaa gggccaaagg acttcagcct catcctggat 300 

gatgtggcca tcacagactt gagctttagc cgagaagtac acaagctgcc tgtaagaaac 360 

ccaaccaagt ggggtgaatt ccaaaaaccc gtgggggtga agggcttctt aagaatgcaa 420 

ggaaggagga aaagaattcc atgggggggg ggttccttaa cccaggaaca ggggtttccc 480 

ttgaattttt ttcca 495 

<210> 231 
<211> 498 
<212> DNA 
<213> Homo sapien 

<400> 231 

ggcagcttct gagaccaggg ttgctccgtc cgtgctccgc ctcgccatga cttcctacag 60 

ctatcgccag tcgtcggcca cgtcgtcctt cggaggcctg ggcggcggct ccgtgcgttt 120 

tgggccgggg gtcgcttttc gcgcgcccag cattcacggg ggctccggcg gccgcggcgt 180 

atccgtgtcc tccgcccgct ttgtgtcctc gtcctcctcg gggggctacg gcggcggcta 240 

cggcggcgtc ctgaccgcgt ccgacgggct gctggcgggc aacgagaagc taaccatgca 300 

gaacctcaac gaccgcctgc ctcctacctg gacaaagtgc gcgccctgga agcgggcaac 360 

ggcgaactta gaggtgaaag aatcccgcga actggtacca aaaacaaggg gcctggggcc 420 

ttccgcgact tacagccaac ttactacacc gaacattcaa gaacttgcgg gaacaaaaat 480 

ttttggtgcc acccattt 498 

<210> 232 
<211> 465 
<212> DNA 
<213> Homo sapien 

<400> 232 

caggccggcc gagtaggaaa gctggaggcg cgggtgggga acatgtctga gtcggagctc 60 

ggcaggaagt gggaccggtg tctggcggat gcggtcgtga agataggtac tggttttgga 120 

ttaggaattg ttttctcact taccttcttt aaaagaagaa tgtggccatt agccttcggt 180 

tctggcatgg gattaggaat ggcttattcc aactgtcagc atgatttcca ggctccatat 240 

cttctacatg gaaaatatgt caaagagcag gagcagtgac ttcacctgag aacatcccag 300 

cgggaggaca agagaaaatc atgtttattc ctcaggaata cttgaagtgc cctggagtaa 360 

actgccattc ttctgtaaca atggtatcag taatgcttta aactccagca cctggttatg 420 

catttgaaac ccaagtctgg ttcttggttt ggattttctc tctgg 465 

<210> 233 
<211> 366 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc__feature 
<222> 90, 97, 242, 244 
<223> n = A,T,C or G 

<400> 233 

cagtaaaaaa ggttatgttt tattaattgc tggacaaccg tgggaaaaca aataagcaat 60 
tgacaccacc aaattcttat tacattcaan ataaaanatt tattcacacc acaaaaagat 120 
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aatcacaaca aaatatacac 
tattctcttt ttaagtgggt 
ananaataac aaatattaaa 
gacataaccc tgaagctttt 
ttatca 



taacttaaaa aacaaaagat 
aaaagtattt tgtttgcttc 
atacagtgat agtttgcatt 
agtttacagg gagtttccat 



tatagtgaca taaaatgtta 180 
tacataaatt tctattcatg 240 
tcttctatag aatgaacata 300 
gaagccacaa actaaactaa 360 
366 



<210> 234 
<211> 379 
<212> DNA 
<213> Homo sapiens 

<400> 234 

gagggcagcc ctcctacctg cgcacgtggt 

acccagtgcc tgcagccatg gctcccggcc 

ccggccttgt ggaatttgca agaaacctga 

gagggactgc aaaagctctc agggatgctg 

cgggatttct gaaatgttgg ggggacgtgt 

tggaatccta gctcctaata ttcagaagat 
tcttataaca attgttgcc 

<210> 235 
<211> 406 
<212> DNA 
<213> Homo sapiens 



gccgccgctg ctgcctcccg ctcgccctga 60 

agctcgcctt atttagtgtc tctgacaaaa 120 

ccgctcttgg tttgaatctg gtcgcttccg 180 

gtctggcagt cacagatgtc tctgagttga 240 

gaaaactttg catcctgcac gatcccatgc 300 

aatgcttgac atgcgccaca cttgattcaa 360 
379 



<400> 235 

caggctgcac catgtacccc accttcagtt 

actgggaggt gggacccctt tcattttcag 

gtttccagtt taattgtttt tgaccttcta 

cattactgtt gttaagcact gtgttccaga 

tttatagttt tattttaatg aaaccctgat 

tttacctgtc accgaagcca ggaagccccg 

tacatcctcc agtggcgttc tttttactct 

<210> 236 
<211> 278 
<212> DNA 
<213> Homo sapiens 



taaaagaaaa aaaaaatccc cttcactcct 60 
ttttgctcat ctagggaaaa taaggctttg 120 
aaatgttttt atgttagcac tgatagttgg 180 
ccgtgtctga cttagtgtaa cctaggagat 240 
tgacgcacag cagtggggag aacagcgtct 300 
tttgtaagcg tgtgttgtgg tgctttattg 360 
aatgttcttt tggttt 406 



<400> 236 

gagattagca cctgtgaaca atgcgttctc 
tcttaagcta attctgcaaa atcacatatt 
caatggacag atactggaaa ccattggagg 
ggctatctgc atagaaaact catgcatggt 
cattcacata ttccgagaga tcatccaacc 



tgatgacact ctgagcatgg accaacgcct 60 
gaaagtaaaa gttggcctta gcgacctcta 120 
caaacaactc cgagtctttg tgtatcggac 180 
gagaggaagc aagcagggaa ggaacggtgc 240 
agcagaat 278 



<210> 237 
<211> 322 
<212> DNA 
<213> Homo sapiens 



<400> 237 

cagggccgtg gcggaggagg agcgctgcac 

ggagttcgtg cagcagtacg tgcgcccctg 

cgtctgcccc gaccgcccaa ggccgccttc 

ggcggcgagg ctggcggtcc ggcctggccg 

gccctgtgct cccgcgacag gttgctggct 

ccgccaacac ctactcctac ct 



ggtggagcgt cgggccgacc tcacctacgc 60 
atcgcggagg tcgcgtcctg ttcaccggcc 120 
ccctgacctc gcgcgcacgc gtggggctgg 180 
cgactctgcc cttctttcca gaggttccgg 240 
tcgtttgggg acagagtggt ccggtgagca 300 
322 
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<210> 238 
<211> 613 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 399 

<223> n = A,T,C or G 



<400> 238 

gaattcggca ccagccttct tggatcagga ccagtctcca ccccgtttct acagtggaga 60 

tcagcctcct tcttatcttg gtgcaagtgt ggataaactc catcaccctt tagaatttgc 120 

agacaaatct cccacacctc ctaatttacc tagcgataaa atctaccctc cttctgggtc 180 

ccccgaagag aataccagca cagccaccat gacttacatg acaactactc cagcaacagc 240 
ccaaatgagc accaaggaag ccagctggga tgtggctgaa caacccacca ctgctgattt 300 

tgctgctgcc acacttcagc gcacgcacag aactaatcgt ccccttcccc ctccgccttc 360 

ccagagatct gcagagcagc caccagttgt ggggcaggna caagcagcaa ccaatatagg 420 

attaaataat tcccacaagg ttcaaggagt agttccagtt ccagagaggc cacctgaacc 480 

tcgagccatg gatgaccctg cgtctgcctt catcagtgac agtggtgctg ctgctgctca 540 

gtgtcccatg gctacagctg tccagccagg cctgcctgag aaagtgcggg acggtgcccg 600 
ggtcccgctg ctg 613 

<210> 239 
<211> 613 
<212> DNA 
<213> Homo sapiens 

<400> 239 

gaattcggca ccaggggaca ctggtgctga gctggatgat gatcagcact ggtctgacag 60 

cccgtcggat gctgacagag agctgcgttt gccgtgccca gctgaggggg aagcagagct 120 

ggagctgagg gtgtcggaag atgaggagaa gctgcccgcc tcaccgaagc accaagagag 180 

aggtccctcc caagccacca gccccatccg gtctccccag gaatcagctc ttctgttcat 240 

tccagtccac agcccctcaa cagaggggcc ccaactccca cctgtccctg ccgccaccca 300 

ggagaaatca cctgaggagc gccttttccc tgagcctttg ctccccaaag agaagcccaa 360 

agctgatgcc ccctcggatc tgaaagctgt gcactctccc atccgatcac agccagtgac 420 

cctgccagaa gctaggactc ctgtctcacc agggagcccg cagccccagc cacccgtggc 480 

ggcctccacg cccccaccca gcgaggtctc cagagccttc tctctcctgt gcaaaatggc 540 

aactcttaag gaaaaactca ttgcaccagt tgcggaagaa gaggcaacag ttccaaacaa 600 
taagatcact gta 613 

<210> 240 
<211> 585 
<212> DNA 
<213> Homo sapiens 



<400> 240 

gaattcggca cgaggtgaga tctacgatga 
tgccaacccc agttattaaa gcttttggca 
aggattatgg tcttgatcca aagattgcta 
ctgaaggtaa attaaatgat cattttcctc 
agacaaatat gaatgtaaat gaagtcatta 
aacttggcag caagatacct gtgcatccca 
atgatacttt tcccacagca atgcacattg 
taccaggact acagaagtta catgatgctc 
tcatcaagat tggacgtact catactcagg 
ttagtggtta tgttcaacaa gtaaaatatg 



actttaagat tggaggtgtg acagaacgca 60 
tcttgaagcg agcggccgct gaagtaaacc 120 
atgcaataat gaaggcagca gatgaggtag 180 
tcgtggtatg gcagactgga tcaggaactc 240 
gcaatagagc aattgaaatg ttaggaggtg 300 
acgatcatgt taataaaagc cagagctcaa 360 
ctgctgcaat agaagttcat gaagtactgt 420 
ttgatgcaaa atccaaagag tttgcacaga 480 
atgctgttcc acttactctt gggcaggaat 540 
caatgacaag aataa 585 
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<210> 241 
<211> 566 
<212> DNA 
<213> Homo sapiens 

<400> 241 

gaattcggca ccaggcgagc tgcacctcga ggtgaaggcc tcactgatga acgatgactt 60 
cgagaagatc aagaactggc agaaggaagc ctttcacaag cagatgatgg gcggcttcaa 120 
ggagaccaag gaagctgagg acggctttcg gaaggcacag aagccctggg ccaagaagct 180 
gaaagaggta gaagcagcaa agaaagccca ccatgcagcg tgcaaagagg agaagctggc 240 
tatctcacga gaagccaaca gcaaggcaga cccatccctc aaccctgaac agctcaagaa 300 
attgcaagac aaaatagaaa agtgcaagca agatgttctt aagaccaaag agaagtatga 360 
gaagtccctg aaggaactcg accagggcac accccagtac atggagaaca tggagcaggt 420 
gtttgagcag tgccagcagt tcgaggagaa acgccttcgc ttcttccggg aggttctgct 480 
ggaggttcag aagcacctag acctgtccaa tgtggctggc tacaaagcca tttaccatga 540 
cctggagcag agcatcagag cagctg 566 

<210> 242 
<211> 556 
<212> DNA 

<213> Homo sapiens 
<400> 242 

gaattcggca cgagcaaagg tgaagcagga catgcctccg cccgggggct atgggcccat 60 
cgactacaaa cggaacttgc cgcgtcgagg actgtcgggc tacagcatgc tggccatagg 120 
gattggaacc ctgatctacg ggcactggag cataatgaag tggaaccgtg agcgcaggcg 180 
cctacaaatc gaggacttcg aggctcgcat cgcgctgttg ccactgttac aggcagaaac 240 
cgaccggagg accttgcaga tgcttcggga gaacctggag gaggaggcca tcatcatgaa 300 
ggacgtgccc gactggaagg tgggggagtc tgtgttccac acaacccgct gggtgccccc 360 
cttgatcggg gagctgtacg ggctgcgcac cacagaggag gctctccatg ccagccacgg 420 
cttcatgtgg tacacgtagg ccctgtgccc tccggccacc tggatccctg cccctcccca 480 
ctgggacgga ataaatgctc tgcagacctg gaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 540 
aaaaaaaaaa ctcgag 556 

<210> 243 
<211> 591 
<212> DNA 

<213> Homo sapiens 
<400> 243 

gtctatgttt gcagaaatac agatccaaga caaagacagg atgggcactg ctggaaaagt 60 
tattaaatgc aaagcagctg tgctttggga gcagaagcaa cccttctcca ttgaggaaat 120 
agaagttgcc ccaccaaaga ctaaagaagt tcgcattaag attttggcca caggaatctg 180 
tcgcacagat gaccatgtga taaaaggaac aatggtgtcc aagtttccag tgattgtggg 240 
acatgaggca actgggattg tagagagcat tggagaagga gtgactacag tgaaaccagg 300 
tgacaaagtc atccctctct ttctgccaca atgtagagaa tgcaatgctt gtcgcaaccc 360 
agatggcaac ctttgcatta ggagcgatat tactggtcgt ggagtactgg ctgatggcac 420 
caccagattt acatgcaagg gcaaaccagt ccaccacttc atgaacacca gtacatttac 480 
cgagtacaca gtggtggatg aatcttctgt tgctaagatt gatgatgcag ctcctcctga 540 
gaaagtctgt ttaattggct gtgggttttc cactggatat ggcgctgctg t 591 

<210> 244 
<211> 594 
<212> DNA 

<213> Homo sapiens 



<400> 244 
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gaattcggca cgagaacaga gtgaactgag 
aatacagatc caagacaaag acaggatggg 
agctgtgctt tgggagcaga agcaaccctt 
aaagactaaa gaagttcgca ttaagatttt 
tgtgataaaa ggaacaatgg tgtccaagtt 
gattgtagag agcattggag aaggagtgac 
tctctttctg ccacaatgta gagaatgcaa 
cattaggagc gatattactg gtcgtggagt 
caagggcaaa ccagtccacc acttcatgaa 
ggatgaatct tctgttgcta agattgatga 

<210> 245 
<211> 615 
<212> DNA 
<213> Homo sapiens 



catcagtcag aaaaagtcta tgtttgcaga 60 
cactgctgga aaagttatta aatgcaaagc 120 
ctccattgag gaaatagaag ttgccccacc 180 
ggccacagga atctgtcgca cagatgacca 240 
tccagtgatt gtgggacatg aggcaactgg 300 
tacagtgaaa ccaggtgaca aagtcatccc 360 
tgcttgtcgc aacccagatg gcaacctttg 420 
actggctgat ggcaccacca gatttacatg 480 
caccagtaca tttaccgagt acacagtggt 540 
tgcagctcct cctgagaaag tctg 594 



<220> 

<221> misc_feature 
<222> 105 

<223> n = A,T,C or G 
<400> 245 

gtccctttcc tctgctgccg ctcggtcacg cttgtgcccg aaggaggaaa cagtgacaga 60 

cctggagact gcagttctct atccttccac agctctttca ccatnctgga tcacttcctt 120 

tgaatgcaga agcttgctgg ccaaaagatg tgggaattgt tgcccttgag atctattttc 180 

cttctcaata tgttgatcaa gcagagttgg aaaaatatga tggtgtagat gctggaaagt 240 

ataccattgg cttgggccag gccaagatgg gcttctgcac agatagagaa gatattaact 300 

ctctttgcat gactgtggtt cagaatctta tggagagaaa taacctttcc tatgattgca 360 

ttgggcggct ggaagttgga acagagacaa tcatcgacaa atcaaagtct gtgaagacta 420 

atttgatgca gctgtttgaa gagtctggga atacagatat agaaggaatc gacacaacta 480 

atgcatgcta tggaggcaca gctgctgtct tcaatgcttg ttaactggat tgagtccagc 540 

tcttgggatg gacggtatgc cctggtaagt tgcaggagat attgctgtat atgccacagg 600 
aaatgctaga cctac 615 



<210> 246 
<211> 546 
<212> DNA 
<213> Homo sapiens 

<400> 246 

gaattcggca ccaggctgcc tcccgctcgc 
cggccagctc gccttattta gtgtctctgc 
tgaccgctct tggtttgaat ctggtcgctt 
ctggtctggc agtcagagat gtctctgagt 
gtgtgaaaac tttgcatcct gcagtccatg 
ataatgctga catggccaga cttgatttca 
atccctttgt aaagacagtg gcttctccag 
ttgacattgg tggagtaacc ttactgagag 
tggtgtgtga accagaggac tatgtgggtg 
taagga 

<210> 247 
<211> 564 
<212> DNA 
<213> Homo sapiens 



cctgaaccca gtgcctgcag ccatggctcc 60 
aaaaccggcc ttgtgaattt gcaagaaacc 120 
ccggagggac tgcaaaagct ctcagggatg 180 
tgacgggatt tcctgaaatg ttggggggac 240 
ctggaatcct agctcgtaat attccagaag 300 
atcttataag agttgttgcc tgcaatctct 360 
gtgtaactgt tgaggaggct gtggagcaaa 420 
ctgcagccaa aaaccacgct cgagtgacag 480 
ggtgtccacg gagatgcaga gctccgagag 540 
546 



<400> 247 

gaattcggca ccagagatca cgtgcagtga gatgcagcaa aaagttgaac ttctgagata 60 
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tgaatctgaa aagcttcaac aggaaaattc 
tgaagaagat agcatttcta acctgaaatt 
gtggcaaaaa acggaaactg taaaacaaga 
tttaaagaaa cagatttcag aattaaaaat 
agaacttagc caaaagaact ctcaaaacca 
aacagaaatg ctatgccaga aggaaaaaga 
acaagagaag tttaatctga aagaagaact 
agtgtcttct ctggaggcgg agctctctga 
ggaaaaccac cttctcaaag atga 

<210> 248 
<211> 434 
<212> DNA 
<213> Homo sapiens 



tattttgaga aatgaaatta ctactttaaa 120 
agggacatta aatggatctc aggaagaaat 180 
aaatgctgca gttcagaaga tggttgaaaa 240 
caaaaaccaa caattggatt tggaaaatac 300 
ggaaaaactg caagaactta atcaacgtct 360 
gccaggaaac agtgcattgg aggaacggga 420 
ggaacgttgt aaagtgcagt cctccacttt 480 
agttaaaata cagacccata ttgtgcaaca 540 
564 



<400> 248 

gttcttgttt gtggatcgct gtgatcgtca cttgacaatg cagatcttcg tgaagactct 60 
gactggtaag accatcaccc tcgaggttga gcccagtgac accatcgaga atgtcaaggc 120 
aaagatccaa gataaggaag gcatccctcc tgaccagcag aggctgatct ttgctggaaa 180 
acagctggaa gatgggcgca ccctgtctga ctacaacatc cagaaagagt ccaccctgca 240 
cctggtgctc cgtctcagag gtgggatgca aatcttcgtg aagacactca ctggcaagac 300 
catcaccctt gaggtggagc ccagtgacac catcgagaac gtcaaagcaa agatccagga 360 
caaggaaggc attcctcctg accagcagag gttgatcttt gccggaaagc cagcctggga 420 
agatggggcc gcca 434 

<210> 249 
<211> 416 
<212> DNA 
<213> Homo sapiens 

<400> 249 

gcgggcccag gaggcggcgg cggcggcggc ggacgggccc cccgcggcag acggcgagga 60 

cggacaggac ccgcacagca agcacctgta cacggccgac atgttcacgc acgggatcca 120 

gagcgccgcg cacttcgtca tgttcttcgc gccctggtgt ggacactgcc agcggctgca 180 

gccgacttgg aatgacctgg gagacaaata caacagcatg gaagatgcca aagtctatgt 240 

ggctaaagtg gactgcacgg cccactccga cgtgtgctcc gcccaggggg tgcgaggata 300 

ccccacctta aagcttttca agccaggcca agaagctgtg aagtaccagg gtcctcggga 360 

cttccagaca ctggaaaact ggatgctgca gacactgaac gaggagccag tgacac 416 

<210> 250 
<211> 504 
<212> DNA 
<213> Homo sapiens 



<400> 250 

gaattcggca cgaggcgggt aacgttatag 
attgtgatcc gtcccaggca gtggattagg 
aggctgagat ggatcctctc agggcccaac 
tggccgatat gtacaacaga atgaccagtg 
acaaggaagc agagctctcc aagggcgagt 
acctggacat ccatgagcgg atgggcaaaa 
agctgatgaa gagggtgcag cagagctctg 
cctggggtgt accccacccc ttcccacttt 
gtgaagactg ccaggcctag ctct 



tatttgtcag aagttggggt ctccgtgggc 60 
aggccagaag gagatccctt ccacggtgct 120 
agctggctgc ggagctggag gtggagatga 180 
cctgccaccg gaagtgtgtg cctcctcact 240 
ctgtgtgcct ggaccgatgt gtctctaagt 300 
agttgacaga gttgtctatg caggatgaag 360 
ggcctgcatg aggtccctgt cagtatacac 420 
aataaacgtg ctccctgttg ggtgtcatct 480 
504 



<210> 251 
<211> 607 
<212> DNA 
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<213> Homo sapiens 



<400> 251 

gatgaaaata cacaatttta ctagcaaatg 
tactctgctc aaccatatgt taattcatgg 
caatgatgtg gaaaagatgg ccgcacacat 
acctaaaaca gattctactt tgagttttga 
catccatctc ctggtaacta catacaatct 
ccatgcccaa aataatcctc cagttcctcc 
agatatccct gtaaaaagtt cacctcaagc 
aaccctttgt cctctttgct tttcaatcct 
tcacttacga gagaggcacc aagttattca 
ctacaaatgt atccattgcc ttggtgtgta 
tctgcat 

<210> 252 
<211> 618 
<212> DNA 
<213> Homo sapiens 



cctctactgt aatcgctatt tacccacaga 60 
tctgtcttgt ccatattgcc gttcaacttt 120 
gcggatggtt cacattgatg aagagatggg 180 
tttgacattg cagcagggta gtcacactaa 240 
gagggatgcc ccagctgaat ctgttgctta 300 
aaagccacag ccaaaggttc aggaaaaggc 360 
tgcagtgccc tataaaaaag atgttgggaa 420 
aaaaggaccc atatctgatg cacttgcaca 480 
gacggttcat ccagttgaga aaaagctcac 540 
taccagcaac atgaccgcct caactatcac 600 
607 



<400> 252 

gaattcgcac caggggtcct gctggtcttc 
cgctgccact ggggtccctg gccccaccga 
cgcacggagc tgaacaagct gcccaagtct 
gatcagcaat ccgagatcga tggcctgaag 
gaacaacagt attttgaaat agaaaagagg 
gaaacccgag agtgtcaaag cttgcggctt 
gcactaactg agaaaaacaa agaacttgaa 
agccaattta caagaacaaa ggaagaatta 
aatgagagac tatctcaaga acttgaatac 
aaacttaaag aaagcaatac aacaaagggt 
gcttctgatg tttctgtt 

<210> 253 

<211> 1201 

<212> DNA 

<213> Homo sapiens 



gcctttcttc tccgcttcta ccccgtcggc 60 
catggcggcg gtgttgagca agtcctggag 120 
gtccagaaca aacttgaaaa gttccttgct 180 
gggcggcatg agaaatttaa ggtggagagc 240 
ttgtcccaca gtcaggagag acttgtgaat 300 
gagctagaga aactcaacaa tcaactgaag 360 
attgctcagg atcgcaatat tgccattcag 420 
gaagctgaga aaagagactt aattagaacc 480 
ttaacagagg atgttaaacg tctgaatgaa 540 
gaacttcagt taaaattgga tgaacttcaa 600 
618 



<400> 253 

gaattcggca ccagggtggc gagcgcggct 
tgtgagtttg gcatgatttg gtcccctggg 
tacttcctgg aagaaaacta aaacaataca 
ccccttacaa atatggacac atttcctagc 
aatcctgagc ctgagttcca aaatatgtta 
aaacataatt atcaggctct gaaaattgaa 
tcacaaaatg aacttaaacg tgtattaatt 
ctgctccttg aagacttaag gggagaatta 
aaactgcagg tactaacacc acaaaaattg 
ttagaagctc caatgcgaga acgttttcgg 
gctgagtata acaagctgcg ctacgagtat 
aaagaagagt ttactcgggt ttcagaagaa 
cgactggaga aggacaaaga ggagctacat 
gacagcaaac gaatggagca acttgttcga 
agtttagagg ctgaagtagc agaattaagg 
gaaaatgtcc aaagaataca ggtgaggcag 
ttggaggctg aaaagcagtc agctaaacta 
tcaagcaatg aacagaatac ctgcttaatc 
agcacactgg ccagtgaagt gaaagagctt 



gctgtgctgg ggcgagcagc ggggaccgtg 60 
attctgcctt agcaagaaag aagttggaaa 120 
aaagccacag cttattgatt gcatgtcagc 180 
ctatttccac ctggaggaga tagtaggctg 240 
attgatgaaa gggtacgctg tgaacatcat 300 
cacaaaaggt tgcaggaaga atatgtaaaa 360 
gaaaagcaag caagccagga aaaattccaa 420 
gtagagaaag ctagagacat agaaaaaatg 480 
gaattggtaa aagcccaact acaacaagaa 540 
actcttgatg aagaagtgga aaggtacaga 600 
acatttctca agtcagagtt tgaacaccag 660 
gagaaaatga aatacaagtc agaggttgca 720 
aaccagctgc ttagtgttga tcccacgaga 780 
gaaaaaaccc atttgcttca gaaattgaaa 840 
gctgagaaag aaaattctgg tgctcaggta 900 
ttggctgaga tgcaggctac actcagatcc 960 
caagctgagc gtttagaaaa agaactacaa 1020 
agcaaactgc atagagctga ccgagaaatc 1080 
aaacatgcaa acaaactaga aataactgac 1140 
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atcaaactgg aggcagcaag agctaagagt gagctcgaaa gagaaaggaa taagatccaa 1200 
a 1201 

<210> 254 
<211> 560 
<212> DNA 
<213> Homo sapiens 

<400> 254 

gaattcggca ccagtttggg gggtgaggtt taattggaaa tggtctctgg ggactgaaaa 60 
ctgatgtttt tgcagattac ctcagggaaa cggaggtttg ttgagttaca gacacattaa 120 
accaaaggcc gtgggaaaac ccctctccag ctccagggga ttggtcagga ccacccacta 180 
accagtgcct tccttcttaa cattcacttt tagcagcttg tgtttatttt acatgggcag 240 
ttttgatggg aaattgccat gaccacaggg gtttggagtt ctgctttttt tttttcttct 300 
tctttttcgg gggactgggg gactcctccc aagatcacat tttagcatct ttctctccta 360 
ctccatttag aaaaataagt aacaggtgaa atgtggtctc agtgttaacg ggataattct 420 
gctaccggct cctccctgat gattctgaaa tacactactg aacgagctct ggctggtcct 480 
ttctatcctg gatgtggttc ttctgtgtag caattccttg atgtccagtt tggaaagatg 540 
tactcttctc aacaagaaaa 560 

<210> 255 
<211> 612 
<212> DNA 
<213> Homo sapiens 

<400> 255 

gaattcggca ccaggcgggg cagcagggcc gcggccatgg ggagcttgaa ggaggagctg 60 
ctcaaagcca tctggcacgc cttcaccgac tcgaccagga ccacagggca aggtctccaa 120 
gtcccagctc aaggtccttt cccataacct gtgcacggtg ctgaaggttc ctcatgaccc 180 
agttgccctt gaagagcact tcagggatga tgatgagggt ccagtgtcca accagggcta 240 
catgccttat ttaaacaggt tcattttgga aaaggtccaa gacaactttg acaagattga 300 
attcaatagg atgtgttgga ccctctgtgt caaaaaaaaa cctcacaaag aatcccctgc 360 
tcattacaga agaagatgca tttaaaatat gggttatttt caacttttta tctgaggaca 420 
agtatccatt aattattgtg tcagaagaga ttgaatacct gcttaagaag cttacagaag 480 
ctatgggagg aggttggcag caagaacaat ttgaacatta taaaatcaac tttgatgaca 540 
gtaaaaatgg cctttctgca tgggaactta ttgagcttat tggaaatgga cagtttagca 600 
aaggcatgga cc 612 

<210> 256 

<211> 1132 

<212> DNA 

<213> Homo sapiens 

<400> 256 

gaattcggca cgaggtctgg gagaggcctc tggagcagga ggcccagtgg ctcttctgac 60 

ccaaggcccc gccgtccagc ttctaagtgc cagatgatgg aggagcgtgc caacctgatg 120 

cacatgatga aactcagcat caaggtgttg ctccagtcgg ctctgagcct gggccgcagc 180 

ctggatgcgg accatgcccc cttgcagcag ttctttgtag tgatggagca ctgcctcaaa 240 

catgggctga aagttaagaa gagttttatt ggccaaaata aatcattctt tggtcctttg 300 

gagctggtgg agaaactttg tccagaagca tcagatatag cgactagtgt cagaaatctt 360 

ccagaattaa agacagctgt gggaagaggc cgagcgtggc tttatcttgc actcatgcaa 420 

aagaaactgg cagattatct gaaagtgctt atagacaata aacatctctt aagcgagttc 480 

tatgagcctg aggctttaat gatggaggaa gaagggatgg tgattgttgg tctgctggtg 540 

ggactcaatg ttctcgatgc caatctctgc ttgaaaggag aagacttgga ttctcaggtt 600 

ggagtaatag atttttccct ctaccttaag gatgtgcagg atcttgatgg tggcaaggag 660 

catgaaagaa ttactgatgt ccttgatcaa aaaaattatg tggaagaact taaccggcac 720 

ttgagctgca cagttgggga tcttcaaacc aagatagatg gcttggaaaa gactaactca 780 

aagcttcaag aagagctttc agctgcaaca gaccgaattt gctcacttca agaagaacag 840 



